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11C and 11D. 

6. On October 1 8, 2004, the USPTO sent an Updated Filing Receipt to Applicant which lists 
an incorrect filing date of September 22, 2004. 

7. Applicants respectfully assert that the Updated Filing Receipt, mailed on October 18, 
2004, listed the filing date of the instant Application as September 22, 2004 in error and 
request that this filing date be corrected to January 28, 2004. 

8. The September 22, 2004 preliminary amendment made the following changes to the 
specification: 

8.1 The specification was amended to include a sequence listing. This sequence listing 
does not contain new material, as attested by Applicant in a "Statement Under 37 
C.F.R. §1.821 in Support of Submission of the Sequence Listing" filed on 
September 22, 2004. 

8.2 The specification was amended to better describe Figure 19 in the section entitled 
"Brief Description of the Figures" (page 8 line 24). The amended text is supported 
by the text shown in Figure 19 of the original application filed on January 28, 2004. 

8.3 The replacement Formal Figures contain Figures 11C and 11D. These figures are 
not novel for at least two reasons. First, Figures 1 1C and 1 ID were included as part 
of the original patent application filed on January 28, 2004. Second, the 
specification of the instant application (U.S. Patent Application Serial Number 
10/767,779) specifically incorporated by reference Provisional Application Serial 
No. 60/476,592 on page 1, line 11. Therefore, the disclosure of Provisional 
Application Serial No. 60/476,592 must be considered part of the originally filed 
application. Support for Figure 1 1C and 1 ID may be found in Figures 1 1C and 1 ID 
of Provisional Application Serial No. 60/476,592. For these two reasons, the 
replacement Formal Figures does not contain new matter. 

9. For the reasons stated above, Applicants respectfully asserts that the Preliminary 
Amendment of September 22, 2004 did not introduce new matter into the Application and 
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MAIL STOP PETITION 

Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 22313-1450 



PETITION TO CHANGE FILING DATE 



Dear Sir: 



1 . Applicants request that the filing date of September 22, 2004 set forth on the Filing 
Receipt mailed by the U.S. Patent and Trademark Office on October 18, 2004, be 
corrected to January 28, 2004. A copy of the Filing Receipt (Exhibit A) is enclosed. The 
statements below are made as to the exact facts that are relied upon to establish that the 
Application was filed on January 28, 2004. 

2. On June 6, 2003, Applicants' filed Provisional Application Serial No. 60/476,592 which 
contains, among other things, Figures 1 1C and 1 ID. 

3. On January 28, 2004, Applicants filed U.S. Patent Application Serial No. 10/767,779 
which also contains Figures 11C and 11D. The USPTO acknowledged receipt of this 
Application with all figures (41 pages comprising figures 1-42). Patent Application Serial 

No. 10/767,779 also incorporated by reference Provisional Application Serial No. A 
60/476,592 on page 1 , line 11. § 



4. On June 22, 2004, the USPTO issued a Notice to File Missing Parts, alleging that the s 

to 

filed application was missing Figures 1 1C and 1 ID. § 

§ 

5. On September 22, 2004, Applicants submitted a preliminary amendment which replaced || 

CO 

the original Drawings with Formal Drawings. The Formal Drawings included Figures « 
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hence, the change of the filing date of the instant application to September 22, 2004 was 
made in error. 

10. Under the requirements set forth in 37 C.F.R. §1.181, the following documents are 
submitted herewith in support of this Petition: 

a. a copy of an Updated Filing Receipt for U.S. Patent Application No. 10/767,779, 
mailed October 18, 2004 which lists the Filing Date of the Application as 
September 22, 2004 (Exhibit A). 

b. a copy of Provisional Application Serial No. 60/476,592, bearing Attorney Docket 
number 21465-514 (Exhibit B). 

c. a copy of a Filing Receipt of Provisional Application Serial No. 60/476,592, mailed 
July 15, 2003, showing Attorney Docket Number 21465-514 (Exhibit C), the filing 
date of the Provisional Application as June 6, 2003 and the U.S. Patent Application 
Serial No. 60/476,592. 

d. a copy of the Express Mail Label: EV262659147US, used for filing Provisional 
Application Serial No. 60/476,592, bearing Attorney Docket number 21465-514 
(Exhibit D), showing the filing date of the Provisional Application as June 6, 2003. 

e. a copy of Express Mail Label: EV399526138US, used for filing U.S. Patent 
Application No. 10/767,779, showing Patent Application, Attorney Docket No. 
21465-510, including a date stamp indicating date of mailing as January 28, 2004 
(Exhibit E); 

f a copy of the Return Postcard mailed with Applicants' Patent Application, bearing 
Attorney Docket number 21465-510, including a date stamp indicating a date of 
receipt by U.S. Patent Office of January 28, 2004 and the U.S. Patent Application 
Serial No. 10/767,779 (Exhibit F). 

g. a copy of the specification of U.S. Patent Application Serial No. 10/767,779 as filed 
on January 28, 2004 (Exhibit G). 

h. a copy of the Notice to File Missing Parts for U.S. Patent Application Serial No. 
10/767,779 (Exhibit H). 

i. a copy of the preliminary amendment filed on September 22, 2004 for U.S. Patent 
Application Serial No. 10/767,779 (Exhibit I). 

j. a copy of a "Statement Under 37 C.F.R. §1.821 in Support of Submission of the 
Sequence Listing" filed on September 22, 2004 for U.S. Patent Application Serial 
No. 10/767,779 (Exhibit J). 
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1 1 . Accordingly, Applicants respectfully submit that the Updated Filing Receipt mailed October 
18, 2004 is in error, and request that the filing date be corrected to January 28, 2004. 

12. Pursuant to 37 C.F.R. §1. 17(f), a check for $400.00 covering the petition fee is enclosed. 
Should any additional fee be due, the Commissioner is hereby authorized to charge such 
fee, or credit any overpayment, to Deposit Account Number: 50-0311, Customer Number: 
35437, Reference Number: 21465-510 UTIL. Because the Updated Filing Receipt 
mailed October 18, 2004 is in error, Applicants hereby respectfully request a refund of the 
petition fee. A duplicate copy of this petition is enclosed. 

13. Should the Commissioner have any questions concerning this petition, he is invited to 
telephone the undersigned at the number provided. 

Respectfully submitted, 

Ivor R. Elrifi, Reg. No. 39,529 
Michelle A. Klein, Reg. No. 55,296 
Attorneys/Agents for Applicants 
c/o MINTZ, LEVIN, et al. 
666 Third Avenue-24' h Floor 
New York, New York 10017 
Telephone: (212)983-3000 
Telefax: (212)983-3115 
Tel: (212)692-6296 
Fax: (212)983-3115 



Dated: March 8, 2006 
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METHODS OF AMPLIFYING AND SEQUENCING NUCLEIC ACIDS 

RELATED APPLICATIONS 

This application claims the benefit of priority to the following applications: 

USSN 60/443,471 filed January 29, 2003, USSN 60/465,071 filed April 23, 2003. 

5 This application incorporates by reference the following copending U.S. patent 

applications: "Method for Preparing Single Stranded Nucleic Acid Libraries", filed 
June 6, 2003, by Costa et aL "Bead Emulsion Nucleic Acid Amplification," by Berka et 
al., filed on June 6, 2003 and with a serial number yet to be assigned; and "Double 
Ended Sequencing " filed on June 6, 2003, by Ronan et al, a serial number yet to be 
10 assigned. 

All patent and patent applications referred to in this application are hereby fully 
incorporated by reference. 



FIELD OF THE INVENTION 

1 5 This invention relates to a method and apparatus for determining the base 

sequences of DNA. More particularly, this invention relates to methods and an apparatus 
with which the base sequences of a genome can be amplified and determined by 
automatically or semiautomatically. 

20 BACKGROUND OF THE INVENTION 

Development of rapid and sensitive nucleic acid sequencing methods utilizing 
automated DNA sequencers has revolutionized modem molecular biology. Analysis of 
entire genomes of plants, fungi, animals, bacteria, and viruses is now possible with a 
concerted effort by a series of machines and a team of technicians. However, the goal of 

25 rapid and automated or semiautomatic sequencing of a genome in a short time has not 
been possible. There continues to be technical problems for accurate sample preparation, 
amplification and sequencing. 

One technical problem which hinders sequence analysis of genomes has been the 
inability of the investigator to rapidly prepare numerous nucleic acid sample 

30 encompassing a complete genome in a short period of time. 

1 
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Another technical problem is the inability to representatively amplified a genome 
to a level that is compatible with the sensitivity of current sequencing methods. Modem 
economically feasible sequencing machines, while sensitive, still require in excess of one 
million copies of a DNA fragment for sequencing. Current methods for providing high 
5 copies of DNA sequencing involves variations of cloning or in vitro amplification which 
cannot amplify the number of individual clones (600,000 or more, and tens of millions 
for a human genome) necessary for sequencing a whole genome economically. 

Yet another technical problem in the limitation of current sequencing methods 
which can perform, at most, one sequencing reaction per hybridization of oligonucleotide 
10 primer. The hybridization of sequencing primers is often the rate limiting step 
constricting the throughput of DNA sequencers. 

Because of all these technical problems, DNA sequencers continue to be slower 
and more expensive than would be desired. In the pure research setting it is perhaps 
acceptable if a sequencer is slow and expensive. But when it is desired to use DNA 
15 sequencers in a clinical diagnostic setting such inefficient sequencing methods are 
prohibitive even for a well financed institution. 

Accordingly, it is clear that there exists a need for preparation of a genome or 
large template nucleic acids for sequencing, for amplification of the nucleic acid 
template, and for the sequencing of the amplified template nucleic acids without the 
20 constraint of one sequencing reaction per hybridization. Furthermore, there is a need for 
a system to connect these various technologies into a viable automatic or semiautomatic 
sequencing machine. 

BRIEF SUMMARY OF THE INVENTION 

25 This invention describes an integrated system, comprising novel methods and 

novel apparatus for (1) nucleic acid sample preparation, (2) nucleic acid amplification, 

and (3) DNA sequencing. 

The invention provides a novel method for preparing a library of multiple DNA 

sequences, particularly derived from large template DNA or whole (or partial) genome 
30 DNA. Sequences of single stranded DNA are prepared from a sample of large template 

DNA or whole or partial DNA genomes through fragmentation, polishing, adaptor 
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ligation, nick repair, and isolation of single stranded DNA. The method provides for 
generating a ssDNA library linked to solid supports comprising: (a) generating a library 
of ssDNA templates; (b) attaching the ssDNA templates to solid supports; and (c) 
isolating the solid supports on which one ssDNA template is attached. 
5 The invention also provides for a method of amplifying each individual member 

of a DNA library in a single reaction tube, by, e.g., encapsulating a plurality of DNA 
samples individually in a microcapsule of an emulsion, performing amplification of the 
plurality of encapsulated nucleic acid samples simultaneously, and releasing said 
amplified plurality of DNA from the microcapsules for subsequent reactions. In one 

10 embodiment, single copies of the nucleic acid template species are hybridized to DNA 
capture beads, suspended in complete amplification solution and emulsified into 
microreactors (typically 100 to 200 microns in diameter), afier which amplification (e.g., 
PCR) is used to clonally increase copy number of the initial template species to more 
than 1,000,000 copies of a single nucleic acid sequence, preferably between 2 and 20 

15 million copies of a single nucleic acid. The amplification reaction, for example, may be 
performed simultaneously with at least 3,000 microreactors per microliter of reaction 
mix, and may be performed with over 300,000 microreactors in a single 100^1 volume 
test tube (e.g., a PCR reaction tube). The present invention also provides for a method of 
enriching for those beads that contains a successful DNA amplification event (i.e., by 

20 removing beads that have no DNA attached thereto). 

The invention also provides for a method of sequencing a nucleic acid from 
multiple primers with a single primer hybridization step. Two or more sequencing 
primers are hybridized to the template DNA to be sequenced. All the sequencing primers 
are then protected except for one. Sequencing (e.g., pyrophosphate sequencing) is 

25 performed again by elongating the unprotected primer. The elongation is either allowed 
to go to completion (with additional polymerase and dNTPs if necessary) or is terminated 
(by polymerase and ddNTPs). Chain completion and/or termination reagents are 
removed. Then one of the protected primers is unprotected and sequencing is performed 
by elongating the newly unprotected primer. This process is continued until all the 

30 sequencing primers are deprotected and sequenced. In a preferred embodiment, two 
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primers (one protected and one unprotected) are used to sequence both ends of a double 
stranded nucleic acid. 

The invention also provides an apparatus and methods for sequencing nucleic 
acids using a pyrophosphate based sequencing approach. The apparatus has a charge 
5 coupled device (CCD) camera, microfluidics chamber, sample cartridge holder, pump 
and flow valves. The apparatus uses chemiluminescence as the detection method, which 
for pyrophosphate sequencing has an inherently low background. In a preferred 
embodiment, the sample cartridge for sequencing is termed the TicoTiter plate, 1 and is 
formed from a commercial fiber optics faceplate, acid-etched to yield hundreds of 

10 thousands of very small wells, each well volume of 75 pL. The apparatus includes a 
novel reagent delivery cuvette adapted for use with the arrays described herein, to 
provide fluid reagents to the picotiter plate, and a reagent delivery means in 
communication with the reagent delivery cuvette. Photons from each well on the 
picotiter plate are channeled into specific pixels on the CCD camera to detect sequencing 

15 reactions. 



BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 depicts a schematic representation of the entire process of library 
20 preparation including the steps of template DNA fragmentation (Figure 

1A), end polishing (Figure IB), adaptor ligation (Figure 1C), nick repair, 
strand extension and gel isolation (Figure ID). Figure 1 also depicts a 
representative agarose gel containing a sample preparation of a 180-350 
base pair adenovirus DNA library according to the methods of this 
25 invention. 

Figure 2 depicts a schematic representation of the universal adaptor design 
according to the present invention. Each universal adaptor is generated 
from two complementary ssDNA oligonucleotides that are designed to 
contain a 20 bp nucleotide sequence for PCR priming, a 20 bp nucleotide 
30 sequence for sequence pruning and a unique 4 bp discriminating sequence 
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comprised of a non-repeating nucleotide sequence {i.e., ACGT, CAGT, 
etc.). 

Figure 3 depicts the strand displacement and extension of nicked double-stranded 
DNA fragments according to the present invention. Following the ligation 
5 of universal adaptors generated from synthetic oligonucleotides, double- 

stranded DNA fragments will be generated that contain two nicked regions 
following T4 DNA ligase treatment (Figure 3A). The addition of a strand 
displacing enzyme (i.e., Bst DNA polymerase I) will bind nicks (Figure 
3B), strand displace the nicked strand and complete nucleotide extension 

10 of the strand (Figure 3C) to produce non-nicked double-stranded DNA 

fragments (Figure 3D). 
Figure 4 depicts the isolation of directionally-ligated single-stranded DNA 
according to the present invention using streptavidin-coated beads. 
Following ligation with universal adaptors A and B (the two different 

15 adaptors are sometimes referred to as a "first" and "second" universal 

adaptor), double-stranded DNA will contain adaptors in four possible 
combinations: AA, BB, AB and BA. When universal adaptor B contains 
a 5*-biotin, magnetic streptavidin-coated solid supports are used to capture 
and isolate the AB, BA and BB populations (population AA is washed 

20 away). The BB population is retained on the beads as each end of the 

double-stranded DNA is attached to a bead and is not released. However, 
upon washing in the presence of a low salt buffer, only populations AB 
and BA will release a single-stranded DNA fragment that is 
complementary to the bound strand. Single-stranded DNA fragments are 

25 isolated from the supernatant and used as template for subsequent 

amplification and sequencing. This method is described below in more 
detail. 

Figure 5 depicts a schematic of the structure of a DNA capture bead. 
Figure 6 depicts a schematic of one embodiment of a bead emulsion amplification 
30 process. 
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Figure 7 depicts a schematic of an enrichment process to remove beads that do not 

have any DNA attached thereto. 
Figure 8 depicts a schematic representation of the double ended sequencing 

reaction according to the present invention (two pages, Fig. 8A and 8B). 
5 Figure 9 depicts a double-ended sequencing demonstration on a pyrosequencing 

apparatus from 454 Coip. 
Figure 10 depicts an exemplary double ended sequencing process. 
Figure UA-D depicts schematic illustrations of rolling circle-based amplification using 

an anchor primer. 

10 Figure 12 depicts a drawing of a sequencing apparatus according to the present 
invention. 

Figure 13 depicts a drawing of a reagent delivery / perfusion chamber according to 
the present invention. 

Figure 14 depicts a micrograph of a cavitated fiber optic bundle, termed a picotiter 
15 plate, of the present invention. 

Figure 15 depicts a micrograph of a picotiter plate "carpeted" with beads having 

DNA template immobilized thereon and sulfurylase and luciferase 

immobilized thereon. 

Figure 16 depicts a schematic illustration of the reagent flow chamber and FORA 
20 (Picotiter plate). 

Figure 17 depicts a diagram of the analytical instrument of the present invention. 
Figure 18 is a schematic illustration of microscopic parallel sequencing reactions 

within a picotiter plate. 
Figure 19 is a micrograph of single well reactions. 



25 



DETAILED DESCRIPTION OF THE INVENTION 



The present invention is directed to a method and apparatus for performing 
genomic sequencing which satisfies the objectives of (1) preparing a nucleic acid (e.g., a 
30 genome) in a rapid and efficient manner for sequencing, (2) amplifying the nucleic acid 
in a representative manner, and (3) performing multiple sequencing reactions with only 
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one primer hybridization. The present invention is particularly suited for genotyping, 
detection and diagnosis from a small sample of nucleic acid in a cost efficient manner. 
Each of these objectives are listed below. 
Definitions: 

5 Unless otherwise defined, all technical and scientific terms used herein have the 

same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Methods and materials similar or equivalent to those described herein 
can be used in the practice of the present invention, and exemplified suitable methods and 
materials are described below. For example, methods may be described which comprise 

10 more than two steps. In such methods, not all steps may be required to achieve a defined 
goal and the invention envisions the use of isolated steps to achieve these discrete goals. 
The disclosures of all publications, patent applications, patents and other references are 
incorporated in toto herein by reference. In addition, the materials, methods, and 
examples arc illustrative only and not intended to be limiting. 

15 As used herein, the term "universal adaptor" refers to two complementary and 

annealed oligonucleotides that are designed to contain a nucleotide sequence for PCR 
priming and a nucleotide sequence for sequence priming. Optionally, the universal 
adaptor may further include a unique discriminating key sequence comprised of a non- 
repeating nucleotide sequence (/.<?., ACGT, CAGT, etc.). A set of universal adaptors 

20 comprises two unique and distinct double-stranded sequences that can be ligated to the 
ends of double-stranded DNA. Therefore, the same universal adaptor or different 
universal adaptors can be ligated to either end of the DNA molecule. When comprised in 
a larger DNA molecule that is single stranded or when present as an oligonucleotide, the 
universal adaptor may be referred to as a single stranded universal adaptor. 

25 "Target DNA" shall mean a DNA whose sequence is to be determined by the 

methods and apparatus of the invention. 

Binding pair shall mean a pair of molecules that interact by means of specific 
non-covalent interactions that depend on the three-dimensional structures of the 
molecules involved. Typical pairs of specific binding partners include antigen-antibody, 

30 hapten-antibody, hormone-receptor, nucleic acid strand-complementary nucleic acid 
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strand, substrate-enzyme, substrate analog-enzyme, inhibitor-enzyme, carbohydrate- 
lectin, biotin-avidin, and virus-cellular receptor. 

As used herein, the term "discriminating key sequence" refers to a sequence 
consisting of at least one of each of the four deoxyribonucleotides (i.e., A, C, G, T). The 
5 same discriminating sequence can be used for an entire library of DNA fragments. 
Alternatively, different discriminating key sequences can be used to track libraries of 
DNA fragments derived from different organisms. 

As used herein, the term "plurality of molecules" refers to DNA isolated from the 
same source, whereby different organisms may be prepared separately by the same 
10 method. In one embodiment, the plurality of DNA samples is derived from large 

segments of DNA, whole genome DNA, cDNA, viral DNA or from reverse transcripts of 
viral RNA. This DNA may be derived from any source, including mammal (i.e., human, 
nonhuman primate, rodent or canine), plant, bird, reptile, fish, fungus, bacteria or virus. 

As used herein, the term "library" refers to a subset of smaller sized DNA species 
15 generated from a single DNA template, either segmented or whole genome. 

As used herein, the term "unique", as in "unique PCR priming regions" refers to a 
sequence that docs not exist or exists at an extremely low copy level within the DNA 
molecules to be amplified or sequenced. 

As used herein, the term "compatible" refers to an end of double stranded DNA to 
20 which an adaptor molecule may be attached (i.e., blunt end or cohesive end). 

As used herein, the term "fragmenting" refers to a process by which a larger 
molecule of DNA is converted into smaller pieces of DNA. 

As used herein, "large template DNA" would be DNA of more than 25kb, 
preferably more than 500kb, more preferably more than 1MB, and most preferably 5MB 
25 or larger. 

As used herein, the term "stringent hybridization conditions" refers to those 
conditions under which only complimentary sequences will hybridize to each other. 

The invention described here is generally a system and methods for processing 
30 nucleic acids. The system and methods can be used to process nucleic acids in a 
multitude of ways that utilize sequencing of nucleic acids. Such sequencing can be 



8 



E/M: EV262659I47US 



21465-514 



performed to determine the identity of a sequence of nucleic acids, or for single 
nucleotide polymorphism detection in nucleic acid fragments, for nucleic acid expression 
profiling (comparing the nucleic acid expression profile between two or more states - 
e.g., comparing between diseased and normal tissue or comparing between untreated 
5 tissue and tissue treated with drug, enzymes, radiation or chemical treatment), for 

haplotying (comparing genes or variations in genes on each of the two alleles present in a 
human subject), for karyotyping (diagnostically comparing one or more genes in a test 
tissue - typically from an embryo/fetus prior to conception to detect birth defects — with 
the same genes from "normal" karyotyped subjects), and for genotyping (comparing one 
10 or more genes in a first individual of a species with the same genes in other individuals of 
the same species). 

The system has a number of components. These include (1) the nucleic acid 
template that is to be processed, (2) a picotiter plate for containing the nucleic acid 
template, (3) a flow chamber and fluid delivery means that permits flow of nucleic acid 

1 5 processing reagents over the nucleic acid template where the processing reagents 

generate light as the nucleic acid is processed, (4) a light capture means that detects light 
emitted as the nucleic acid is processed and that converts the captured light into data, and 
(5) data processing means that processes the data to yield meaningful information about 
the nucleic acid that has been processed. Each of these components of the system will be 

20 discussed in detail below. 

1. NUCLEIC ACID TEMPLATE AND PREPARATION THEREOF 

Nucleic Acid Templates 

The nucleic acid templates that can be sequenced according to the invention, e.g., 
25 a nucleic acid library, in general can include open circular or closed circular nucleic acid 
molecules. A "closed circle" is a covalently closed circular nucleic acid molecule, e.g., a 
circular DNA or RNA molecule. An "open circle" is a linear single-stranded nucleic acid 
molecule having a 5' phosphate group and a 3' hydroxyl group. 

In one embodiment, the single stranded nucleic acid contains at least 100 copies 
30 of a specific nucleic acid sequence, each copy covalently linked end to end. In some 
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embodiments, (he open circle is formed in situ from a linear double-stranded nucleic acid 
molecule. The ends of a given open circle nucleic acid molecule can be ligated by DNA 
ligase. Sequences at the 5' and 3' ends of the open circle molecule are complementary to 
two regions of adjacent nucleotides in a second nucleic acid molecule, e.g., an adapter 
5 region of an anchor primer (sometimes referred to as an adapter), or to two regions that 
are nearly adjoining in a second DNA molecule. Thus, the ends of the open-circle 
molecule can be ligated using DNA ligase, or extended by DNA polymerase in a gap- 
filling reaction. Open circles are described in detail in Lizardi, U.S. Pat. No. 5,854,033, 
fully incorporated herein by reference. An open circle can be converted to a closed circle 

10 in the presence of a DNA ligase (for DNA) or RNA ligase following, e.g., annealing of 
the open circle to an anchor primer. 

If desired, nucleic acid templates can be provided as padlock probes. Padlock 
probes are linear oligonucleotides that include target-complementary sequences located at 
each end, and which are separated by a linker sequence. The linkers can be ligated to 

15 ends of members of a library of nucleic acid sequences that have been, e.g., physically 
sheared or digested with restriction endonucleases. Upon hybridization to a target- 
sequence, the 5 - and 3-terminal regions of these linear oligonucleotides are brought in 
juxtaposition. This juxtaposition allows the two probe segments (if properly hybridized) 
to be covalently-bound by enzymatic ligation (e.g., with T4 DNA ligase), thus converting 

20 the probes to circularly-closed molecules which are catenated to the specific target 

sequences (see e.g., Nilsson, et ai, 1994. Science 265: 2085-2088). The resulting probes 
are suitable for the simultaneous analysis of many gene sequences both due to their 
specificity and selectivity for gene sequence variants (see e.g., Lizardi, et at., 1998. Nat. 
Genet. 19: 225-232; Nilsson, et al t 1997. Nat. Genet. 16: 252-255) and due to the fact 

25 that the resulting reaction products remain localized to the specific target sequences. 
Moreover, intramolecular ligation of many different probes is expected to be less 
susceptible to non-specific cross-reactivity than multiplex PCR-based methodologies 
where non-cognate pairs of primers can give rise to irrelevant amplification products (see 
e.g., Landegren and Nilsson, 1997. Ann. Med. 29: 585-590). 

30 A starting nucleic acid template library can be constructed comprising cither 

single-stranded or double-stranded nucleic acid molecules, provided that the nucleic acid 
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sequence includes a region that, if present in the library, is available for annealing, or can 
be made available for annealing, to an anchor primer sequence. For example, when used 
as a template for rolling circle amplification, a region of a double-stranded template 
needs to be at least transiently single-stranded in order to act as a template for extension 
5 of the anchor primer. 

Library templates can include multiple elements, including, but not limited to, one 
or more regions that are complementary to the anchor primer. For example, the template 
libraries may include a region complementary to a sequencing primer, a control 
nucleotide region, and an insert sequence comprised of the sequencing template to be 
10 subsequently characterized. As is explained in more detail below, the control nucleotide 
region is used to calibrate the relationship between the amount of byproduct and the 
number of nucleotides incorporated. As utilized herein the term "complement 0 refers to 
nucleotide sequences that are able to hybridize to a specific nucleotide sequence to form a 
matched duplex. 

15 In one embodiment, a library template includes: (/) two distinct regions that are 

complementary to the anchor primer, (if) one region homologous to the sequencing 
primer, (//'/) one optional control nucleotide region, (/v) an insert sequence of, e.g., 30- 
500, 50-200, or 60-100 nucleotides, that is to be sequenced. The template can, of course, 
include two, three, or all four of these features. 

20 The template nucleic acid can be constructed from any source of nucleic acid, 

any cell, tissue, or organism, and can be generated by any art-recognized method. 
Suitable methods include, e.g., sonication of genomic DNA and digestion with one or 
more restriction endonucleases (RE) to generate fragments of a desired range of lengths 
from an initial population of nucleic acid molecules. Preferably, one or more of the 

25 restriction enzymes have distinct four-base recognition sequences. Examples of such 
enzymes include, e.g., Sau3Al, Mspl, and Taql. Preferably, the enzymes are used in 
conjunction with anchor primers having regions containing recognition sequences for the 
corresponding restriction enzymes. In some embodiments, one or both of the adapter 
regions of the anchor primers contain additional sequences adjoining known restriction 

30 enzyme recognition sequences, thereby allowing for capture or annealing to the anchor 
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primer of specific restriction fragments of interest to the anchor primer. In other 
embodiments, the restriction enzyme is used with a type IIS restriction enzyme. 

Alternatively, template libraries can be made by generating a complementary 
DNA (cDNA) library from RNA, e.g., messenger RNA (mRNA). The cDNA library can, 
5 if desired, be further processed with restriction endonucleases to obtain a 3' end 

characteristic of a specific RNA, internal fragments, or fragments including the 3' end of 
the isolated RNA. Adapter regions in the anchor primer may be complementary to a 
sequence of interest that is thought to occur in the template library, e.g., a known or 
suspected sequence polymorphism within a fragment generated by endonuclease 
10 digestion. 

In one embodiment, an indexing oligonucleotide can be attached to members of a 
template library to allow for subsequent correlation of a template nucleic acid with a 
population of nucleic acids from which the template nucleic acid is derived. For 
example, one or more samples of a starting DNA population can be fragmented 

15 separately using any of the previously disclosed methods (e.g., restriction digestion, 
sonication). An indexing oligonucleotide sequence specific for each sample is attached 
to, e.g., ligated to, the termini of members of the fragmented population. The indexing 
oligonucleotide can act as a region for circularization, amplification and, optionally, 
sequencing, which permits it to be used to index, or code, a nucleic acid so as to identify 

20 the starting sample from which it is derived. 

Distinct template libraries made with a plurality of distinguishable indexing 
primers can be mixed together for subsequent reactions. Determining the sequence of the 
member of the library allows for the identification of a sequence corresponding to the 
indexing oligonucleotide. Based on this information, the origin of any given fragment 

25 can be inferred. 

The invention includes a sample preparation process that results in a solid or a 
mobile solid substrate array containing a plurality of anchor primers or adapaters 
covalently linked to template nucleic acids. 

When the template nucleic acid is circular, formation of the covalently linked 

30 anchor primer and one or more copies of the target nucleic acid preferably occurs by 
annealing the anchor primer to a complementary region of a circular nucleic acid, and 
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then extending the annealed anchor primer with a polymerase to result in formation of a 
nucleic acid containing one or more copies of a sequence complementary to the circular 
nucleic acid. 

Attachment of the anchor primer to a solid or mobile solid substrate can occur 
5 before, during, or subsequent to extension of the annealed anchor primer. Thus, in one 
embodiment, one or more anchor primers are linked to the solid or a mobile solid 
substrate, after which the anchor primer is annealed to a target nucleic acid and extended 
in the presence of a polymerase. Alternatively, in a second embodiment, an anchor 
primer is first annealed to a target nucleic acid, and a 3'OH terminus of the annealed 
10 anchor primer is extended with a polymerase. The extended anchor primer is then linked 
to the solid or mobile solid substrate. By varying the sequence of anchor primers, it is 
possible to specifically amplify distinct target nucleic acids present in a population of 
nucleic acids. 

Outlined below is a preferred embodiment for the preparation of template nucleic 

15 acids for amplification and sequencing reactions. The invention includes a method for 
preparing the sample DNA comprised of seven general steps: (a) fragmenting large 
template DNA or whole genomic DNA samples to generate a plurality of digested DNA 
fragments; (b) creating compatible ends on the plurality of digested DNA samples; (c) 
ligating a set of universal adaptor sequences onto the ends of fragmented DNA molecules 

20 to make a plurality of adaptor-ligated DNA molecules, wherein each universal adaptor 
sequence has a known and unique base sequence comprising a common PCR primer 
sequence, a common sequencing primer sequence and a discriminating four base key 
sequence and wherein one adaptor is attached to biotin; (d) separating and isolating the 
plurality of ligated DNA fragments; (e) removing any portion of the plurality of ligated 

25 DNA fragments; (f) nick repair and strand extension of the plurality of ligated DNA 
fragments; (g) attaching each of the ligated DNA fragments to a solid support; and (h) 
isolating populations comprising single-stranded adaptor-ligated DNA fragments for 
which there is a unique adaptor at each end (i.e., providing directionality). 

The following discussion summarizes the basic steps involved in the methods of 

30 the invention. The steps are recited in a specific order, however, as would be known by 
one of skill in the art, the order of the steps may be manipulated to achieve the same 
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result. Such manipulations are contemplated by the inventors. Further, some steps may 
be minimized as would also be known by one of skill in the art. 

Fragmentation 

5 In the practice of the methods of the present invention, the fragmentation of the 

DNA sample can be done by any means known to those of ordinary skill in the art. 
Preferably, the fragmenting is performed by enzymatic or mechanical means. The 
mechanical means may be sonication or pnysical shearing. The enzymatic means may be 
performed by digestion with nucleases (e.g., Deoxyribonuclease I (DNase I)) or one or 

10 more restriction endonucleases. In a preferred embodiment, the fragmentation results in 
ends for which the sequence is not known. 

In a preferred embodiment, the enzymatic means is DNase I. DNase I is a 
versatile enzyme that nonspecifically cleaves double-stranded DNA (dsDNA) to release 
5'-phosphorylated di-, tri-, and oligonucleotide products. DNase I has optimal activity in 

1 5 buffers containing Mn 2+ , Mg 2+ and Ca 2+ , but no other salts. The purpose of the DNase I 
digestion step is to fragment a large DNA genome into smaller species comprising a 
library. The cleavage characteristics of DNase I will result in random digestion of 
template DNA (i.e., no sequence bias) and in the predominance of blunt-ended dsDNA 
fragments when used in the presence of manganese-based buffers (Melgar, E. and D.A. 

20 Goldthwait. 1968. Deoxyribonucleic acid nucleases. II. The effects of metal on the 
mechanism of action of deoxyribonuclease I. J. Biol Chem. 243: 4409). The range of 
digestion products generated following DNase I treatment of genomic templates is 
dependent on three factors: i) amount of enzyme used (units); ii) temperature of digestion 
(°C); and iii) incubation time (minutes). The DNase I digestion conditions outlined 

25 below have been optimized to yield genomic libraries with a size range from 50-700 base 
pairs (bp). 

In a preferred embodiment, the DNase I digests large template DNA or whole 
genome DNA for 1-2 minutes to generate a population of polynucleotides. In another 
preferred embodiment, the DNase I digestion is performed at a temperature between 
30 10°C-37°C. In yet another preferred embodiment, the digested DNA fragments are 
between 50 bp to 700 bp in length. 
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Polishing 

Digestion of genomic DNA (gDNA) templates with DNase I in the presence of 
Mn 2+ will yield fragments of DNA that are either blunt-ended or have protruding termini 
5 with one or two nucleotides in length. In a preferred embodiment, an increased number 
of blunt ends are created with Pfu DNA polymerase. In other embodiments, blunt ends 
can be created with less efficient DNA polymerases such as T4 DNA polymerase or 
Klenow DNA polymerase. Pfu "polishing" or blunt ending is used to increase the 
amount of blunt-ended species generated following genomic template digestion with 

10 DNase I. Use of Pfu DNA polymerase for fragment polishing will result in the fill-in of 
5' overhangs. Additionally, Pfu DNA polymerase does not exhibit DNA extendasc 
activity but does have 3'-> 5' exonuclease activity that will result in the removal of 
single and double nucleotide extensions to further increase the amount of blunt-ended 
DNA fragments available for adaptor ligation (Costa, G.L. and MP. Weiner. 1994a. 

15 Protocols for cloning and analysis of blunt-ended PCR-generated DNA fragments. PCR 
Methods Appl 3(5):S95; Costa, G.L., A. Grafsky and MP. Weiner. 1994b. Cloning and 
analysis of PCR-generated DNA fragments. PCR Methods Appl 3(6):338; Costa, G.L. 
and M.P. Weiner. 1 994c. Polishing with T4 or Pfu polymerase increases the efficiency 
of cloning of PCR products. Nucleic Acids Res. 22(12):2423). 

20 

Adaptor Ligation 

If the libraries of nucleic acids are to be attached to the solid substrate, then 
preferably the nucleic acid templates are annealed to anchor primer sequences using 
recognized techniques (see, e.g., Hatch, et ai, 1999. Genet. Anal. Biomol. Engineer. 15: 

25 35-40; Kool, U.S. Patent No. 5,714, 320 and Lizardi, U.S. Patent No. 5,854,033). In 
general, any procedure for annealing the anchor primers to the template nucleic acid 
sequences is suitable as long as it results in formation of specific, i.e., perfect or nearly 
perfect, complementarity between the adapter region or regions in the anchor primer 
sequence and a sequence present in the template library. 

30 In a preferred embodiment, following fragmentation and blunt ending of the DNA 

library, universal adaptor sequences are added to each DNA fragment. The universal 
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adaptors are designed to include a set of unique PCR priming regions that are typically 20 
bp in length located adjacent to a set of unique sequencing priming regions that are 
typically 20 bp in length optionally followed by a unique discriminating key sequence 
consisting of at least one of each of the four deoxyribonucleotides (i.e., A, C, G, T). In a 
5 preferred embodiment, the discriminating key sequence is 4 bases in length. In another 
embodiment, the discriminating key sequence may be combinations of 1 -4 bases. In yet 
another embodiment, each unique universal adaptor is forty-four bp (44 bp) in length. In 
a preferred embodiment the universal adaptors are ligated, using T4 DNA ligase, onto 
each end of the DNA fragment to generate a total nucleotide addition of 88 bp to each 
10 DNA fragment. Different universal adaptors are designed specifically for each DNA 
library preparation and will therefore provide a unique identifier for each organism. The 
size and sequence of the universal adaptors may be modified as would be apparent to one 
of skill in the art. 

For example, to prepare two distinct universal adaptors (i.e., "first" and 

15 "second"), single-stranded oligonucleotides may be ordered from a commercial vendor 
(<".e., Integrated DNA Technologies, IA or Operon Technologies, CA). In one 
embodiment, the universal adaptor oligonucleotide sequences are modified during 
synthesis with two or three phosphorothioate linkages in place of phosphodiester linkages 
at both the 5' and 3' ends. Unmodified oligonucleotides are subject to rapid degradation 

20 by nucleases and are therefore of limited utility. Nucleases are enzymes that catalyze the 
hydrolytic cleavage of a polynucleotide chain by cleaving the phosphodiester linkage 
between nucleotide bases. Thus, one simple and widely used nuclease-resistant 
chemistry available for use in oligonucleotide applications is the phosphorothioate 
modification. In phosphorothioates, a sulfur atom replaces a non-bridging oxygen in the 

25 oligonucleotide backbone making it resistant to all forms of nuclease digestion (i.e. 
resistant to both endonuclease and exonuclease digestion). Each oligonucleotide is 
HPLC-purified to ensure there are no contaminating or spurious oligonucleotide 
sequences in the synthetic oligonucleotide preparation. The universal adaptors are 
•designed to allow directional ligation to the blunt-ended, fragmented DNA. Each set of 

30 double-stranded universal adaptors are designed with a PCR priming region that contains 
noncomplementary 5' four-base overhangs that cannot ligate to the blunt-ended DNA 



16 



EM: EV262659147US 



21465-514 



fragment as well as prevent ligation with each other at these ends. Accordingly, binding 
can only occur between the 3' end of the adaptor and the 5 ? end of the DNA fragment or 
between the 3' end of the DNA fragment and the 5' end of the adaptor. Double-stranded 
universal adaptor sequences are generated by using single-stranded oligonucleotides that 
5 are designed with sequences that allow primarily complimentary oligonucleotides to 
anneal, and to prevent cross-hybridization between two non-complimentary 
oligonucleotides. In one embodiment, 95% of the universal adaptors are formed from the 
annealing of complimentary oligonucleotides. In a preferred embodiment, 97% of the 
universal adaptors are formed from the annealing of complimentary oligonucleotides. In 

10 a more preferred embodiment, 99% of the universal adaptors are formed from the 

annealing of complimentary oligonucleotides. In a most preferred embodiment, 100% of 
the universal adaptors are formed from the annealing of complimentary oligonucleotides. 

One of the two adaptors can be linked to a support binding moiety. In a preferred 
embodiment, a 5' biotin is added to the first universal adaptor to allow subsequent 

15 isolation of ssDNA template and noncovalent coupling of the universal adaptor to the 
surface of a solid support that is saturated with a biotin-binding protein (i.e. streptavidin, 
neutravidin or avidin). Other linkages arc well known in the art and may be used in place 
of biotin-streptavidin (for example antibody/antigen-epitope, receptor/1 igand and 
oligonucleotide pairing or complimentarity) one embodiment, the solid support is a bead, 

20 preferably a polystyrene bead. In one preferred embodiment, the bead has a diameter of 
about 2.8 nm. As used herein, this bead is referred to as a "sample prep bead". 

Each universal adaptor may be prepared by combining and annealing two ssDNA 
oligonucleotides, one containing the sense sequence and the second containing the 
antisense (complementary) sequence. Schematic representation of the universal adaptor 

25 design is outlined in Figure 2. 

Isolation of Ligation Products 

The universal adaptor ligation results in the formation of fragmented DNAs with 
adaptors on each end, unbound single adaptors, and adaptor dimers. In a preferred 
30 embodiment, agarose gel electrophoresis is used as a method to separate and isolate the 
adapted DNA library population from the unligated single adaptors and adaptor dimer 
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populations. In other embodiments, the fragments may be separated by size exclusion 
chromatography or sucrose sedimentation. The procedure of DNase I digestion of DNA 
typically yields a library population that ranges from 50-700 bp. In a preferred 
embodiment, upon conducting agarose gel electrophoresis in the presence of a DNA 
5 marker, the addition of the 88 bp universal adaptor set will shift the DNA library 
population to a larger size and will result in a migration profile in the size range of 
approximately 130-800 bp; adaptor dimers will migrate at 88 bp; and adaptors not ligated 
will migrate at 44 bp. Therefore, numerous double-stranded DNA libraries in sizes 
ranging from 200-800 bp can be physically isolated from the agarose gel and purified 
10 using standard gel extraction techniques. In one embodiment, gel isolation of the adapted 
ligated DNA library will result in the recovery of a library population ranging in size 
from 200-400 bp. Other methods of distinguishing adaptor-ligated fragments are known 
to one of skill in the art. 

15 Nick Repair 

Because the DNA oligonucleotides used for the universal adaptors are not 5' 
phosphorylated, gaps will be present at the 3' junctions of the fragmented DNAs 
following ligase treatment (see Figure 3 A). These two "gaps" or "nicks" can be filled in 
by using a DNA polymerase enzyme that can bind to, strand displace and extend the 

20 nicked DNA fragments. DNA polymerases that lack 3'-> 5' exonuclease activity but 
exhibit 5' -> 3' exonuclease activity have the ability to recognize nicks, displace the 
nicked strands, and extend the strand in a manner that results in the repair of the nicks 
and in the formation of non-nicked double-stranded DNA (see Figure 3B and 3C) 
(Hamilton, S.C., J.W. Farchaus and M.C. Davis. 2001. DNA polymerases as engines for 

25 biotechnology. BioTechniques 31:370). 

Several modifying enzymes are utilized for the nick repair step, including but not 
limited to polymerase, ligase and kinase. DNA polymerases that can be used for this 
application include, for example, E. coli DNA pol 1, Thennoanaerohacter 
thermohydrosulfuricus pol I, and bacteriophage phi 29. In a preferred embodiment, the 

30 strand displacing enzyme Bacillus stearothermophihis pol I (Bst DNA polymerase I) is 
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used to repair the nicked dsDNA and results in non-nicked dsDNA (see Figure 3D). In 
another preferred embodiment, the ligase is T4 and the kinase is polynucleotide kinase. 



Isolation of Single-Stranded DNA 

5 Following the generation of non-nicked dsDNA, ssDNAs comprising both the 

first and second adaptor molecules are to be isolated (desired populations are designated 
below with asterisks; "A" and "B" correspond to the first and second adaptors). Double- 
stranded DNA libraries will have adaptors bound in the following configurations: 



1 0 Universal Adaptor A - DNA fragment - Universal Adaptor A 
Universal Adaptor B - DNA fragment - Universal Adaptor A* 
Universal Adaptor A - DNA fragment - Universal Adaptor B* 
Universal Adaptor B - DNA fragment - Universal Adaptor B 



15 Universal adaptors are designed such that only one universal adaptor has a 5' 

biotin moiety. For example, if universal adaptor B has a 5'biotin moiety, streptavidin- 
coated sample prep beads can be used to bind all double-stranded DNA library species 
with universal adaptor B. Genomic library populations that contain two universal adaptor 
A species will not contain a 5' biotin moiety and will not bind to streptavidin-containing 

20 sample prep beads and thus can be washed away. The only species that will remain 

attached to beads are those with universal adaptors A and B and those with two universal 
adaptor B sequences. DNA species with two universal adaptor B sequences (i.e., biotin 
moieties at each 5 'end) will be bound to streptavidin-coated sample prep beads at each 
end, as each strand comprised in the double strand will be bound. Double-stranded DNA 

25 species with a universal adaptor A and a universal adaptor B will contain a single 5'biotin 
moiety and thus will be bound to streptavidin-coated beads at only one end. The sample 
prep beads are magnetic, therefore, the sample prep beads will remain coupled to a solid 
support when magnetized. Accordingly, in the presence of a low-salt ("melt" or 
denaturing) solution, only those DNA fragments that contain a single universal adaptor A 

30 and a single universal adaptor B sequence will release the complementary unbound 
strand. This single-stranded DNA population may be collected and quantitated by, for 

19 
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example, pyrophosphate sequencing, real-time quantitative PCR, agarose gel 
electrophoresis or capillary gel electrophoresis. 

Attachment of Template to Beads 

5 In one embodiment, ssDNA libraries that are created according to the methods of 

the invention are quantitated to calculate the number of molecules per unit volume. These 
molecules are annealed to a solid support (bead) that contain oligonucleotide capture 
primers that are complementary to the PCR priming regions of the universal adaptor ends 
of the ssDNA species. Beads are then transferred to an amplification protocol. Clonal 

10 populations of single species captured on DNA beads may then sequenced. In one 
embodiment, the solid support is a bead, preferably a sepharose bead. As used herein, 
this bead is referred to as a "DNA capture bead". 

The beads used herein may be of any convenient size and fabricated from any 
number of known materials. Example of such materials include: inorganics, natural 

15 polymers, and synthetic polymers. Specific examples of these materials include: 
cellulose, cellulose derivatives, acrylic resins, glass; silica gels, polystyrene, gelatin, 
polyvinyl pyrrolidone, co-polymcrs of vinyl and acrylamide, polystyrene cross-linked 
with divinylbenzenc or the like (see, Merrifield Biochemistry 1964, 3, 1385-1390), 
polyacrylamidcs, latex gels, polystyrene, dextran, rubber, silicon, plastics, nitrocellulose, 

20 celluloses, natural sponges, silica gels, glass, metals plastic, cellulose, cross-linked 
dextrans (e.g., Sephadex™) and agarose gel (Sepharose™) and solid phase supports 
known to those of skill in the art. In one embodiment, the diameter of the DNA capture 
bead is in the range of 20-70|im. In a preferred embodiment, the diameter of the DNA 
capture bead is in a range of 20-50|am. In a more preferred embodiment, the diameter of 

25 the DNA capture bead is about 30|im. 

In one aspect, the invention includes a method for generating a library of solid 
supports comprising: (a) preparing a population of ssDNA templates according to the 
methods disclosed herein; (b) attaching each DNA template to a solid support such that 
there is one molecule of DNA per solid support; (c) amplifying the population of single- 

30 stranded templates such that the amplification generates a clonal population of each DNA 
fragment on each solid support; (d) sequencing clonal populations of beads. 

20 
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In one embodiment, the solid support is a DNA capture bead. In another 
embodiment, the DNA is genomic DNA, cDNA or reverse transcripts of viral RNA. The 
DNA may be attached to the solid support, for example, via a biotin-streptavidin linkage, 
a covalent linkage or by complementary oligonucleotide hybridization. In one 
5 embodiment, each DNA template is ligated to a set of universal adaptors. In another 
embodiment, the universal adaptor pair comprises a common PCR primer sequence, a 
common sequencing primer sequence and a discriminating key sequence. Single- 
stranded DNAs are isolated that afford unique ends; single stranded molecules are then 
attached to a solid support and exposed to amplification techniques for clonal expansion 

1 0 of populations. The DNA may be amplified by PCR. 

In another aspect, the invention provides a library of solid supports made by the 
methods described herein. 

The nucleic acid template (e.g., DNA template) prepared by this method may be 
used for many molecular biological procedures, such as linear extension, rolling circle 

15 amplification, PCR and sequencing. This method can be accomplished in a linkage 
reaction, for example, by using a high molar ratio of bead to DNA. Capture of single- 
stranded DNA molecules will follow a poisson distribution and will result in a subset of 
beads with no DNA attached and a subset of beads with two molecules of DNA attached. 
In a preferred embodiment, there would be one bead to one molecule of DNA. In 

20 addition, it is possible to include additional components in the adaptors that may be 
useful for additional manipulations of the isolated library. 

2. NUCLEIC ACID TEMPLATE AMPLIFICATION 

In order for the nucleic acid template to be sequenced according to the methods of 
25 this invention the copy number must be amplified to generate a sufficient number of 
copies of the template to produce a delectable signal by the light detection means. Any 
suitable nucleic acid amplification means may be used. 

A number of in vitro nucleic acid amplification techniques have been described. 
These amplification methodologies may be differentiated into those methods: (/) which 
30 require temperature cycling - polymerase chain reaction (PCR) (see e.g., Saiki, et ai, 

1995. Science 230: 1350-1354), ligase chain reaction (see e.g.. Barany, 1991. Proc. Natl. 
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Acad. Set. USA 88: 189-193; Barringer, et al., 1990. Gene 89: 1 17-122) and 
transcription-based amplification (see e.g., Kwoh, et ai, 1989. Proc. Natl. Acad ScL 
USA 86: 1 173-1 177) and (//) isothermal amplification systems - self-sustaining, sequence 
replication (see e.g., Guatelli, et ai, 1990. Proc. Natl. Acad. Sci. USA 87: 1874-1878); 
5 the QP replicase system (see e.g.. Lizardi, et al, 1988. BioTechnology 6: 1 197-1202); 
strand displacement amplification Nucleic Acids Res. 1992 Apr 1 1;20(7): 1691-6.; and 
the methods described in PNAS 1992 Jan l;89(l):392-6; and NASBA J Virol Methods. 
1991 Dec;35(3):273-86. 

In one embodiment, isothermal amplification is used. Isothermal amplification 

10 also includes rolling circle-based amplification (RCA). RCA is discussed in, e.g., Kool, 
U.S. Patent No. 5,714,320 and Lizardi, U.S. Patent No. 5,854,033; Hatch, et al, 1999. 
Genet. Anal. Biomol. Engineer. 15: 35-40. The result of the RCA is a single DNA strand 
extended from the 3' terminus of the anchor primer (and thus is linked to the solid 
support matrix) and including a concatamer containing multiple copies of the circular 

1 5 template annealed to a primer sequence. Typically, 1 ,000 to 1 0,000 or more copies of 
circular templates, each having a size of, e.g. y approximately 30-500, 50-200, or 60-100 
nucleotides size range, can be obtained with RCA. 

The product of RCA amplification following annealing of a circular nucleic acid 
molecule to an anchor primer is shown schematically in Figure 1 1 A. A circular template 

20 nucleic acid 102 is annealed to an anchor primer 104, which has been linked to a surface 
106 at its 5' end and has a free 3' OH available for extension. The circular template 
nucleic acid 102 includes two adapter regions 108 and 1 10 which are complementary to 
regions of sequence in the anchor primer 104. Also included in the circular template 
nucleic acid 102 is an insert 112 and a region 1 14 homologous to a sequencing primer, 

25 which is used in the sequencing reactions described below. 

Upon annealing, the free 3 , -OH on the anchor primer 104 can be extended using 
sequences within the template nucleic acid 102. The anchor primer 102 can be extended 
along the template multiple times, with each iteration adding to the sequence extended 
from the anchor primer a sequence complementary to the circular template nucleic acid. 

30 Four iterations, or four rounds of rolling circle replication, are shown in Figure 1 1 A as 
the extended anchor primer amplification product 1 14. Extension of the anchor primer 
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results in an amplification product covalently or otherwise physically attached to the 
substrate 106. A number of in vitro nucleic acid amplification techniques may be utilized 
to extend the anchor primer sequence. The amplification is typically performed in the 
presence of a polymerase, e.g., a DNA or RNA-directed DNA polymerase, and one, two, 
5 three, or four types of nucleotide triphosphates, and, optionally, auxiliary binding 

proteins. In general, any polymerase capable of extending a primed 3' -OH group can be 
used a long as it lacks a 3' to 5' exonuclease activity. Suitable polymerases include, e.g., 
the DNA polymerases from Bacillus stearothermophilus, TJiermus acquaticus, 
Pyrococcus furiosis, Thermococcus litoralis, and Tliermus thermophilus* bacteriophage 

10 T4 and T7, and the E. coli DNA polymerase I Klenow fragment. Suitable RNA-directed 
DNA polymerases include, e.g., the reverse transcriptase from the Avian Myeloblastosis 
Virus, the reverse transcriptase from the Moloney Murine Leukemia Virus, and the 
reverse transcriptase from the Human Immunodeficiency Virus-I. 

Additional embodiments of circular templates and anchor primers are shown in 

15 more detail in Figure 1 1B-1 ID. Figure 1 IB illustrates an annealed open circle linear 

substrate that can serve, upon ligation, as a template for extension of an anchor primer. A 
template molecule having the sequence 5' - TCg TgT gAg gTC TCA gCA TCT TAT 
gT A TAT TTA CTT CT A TTC TCA gTT gCC TAA gCT gC A gCC A-3' (SEQ ID 
NO:l) is annealed to an anchor primer having a biotin linker at its 5' terminus and the 

20 sequence 5>-gAC CTC ACA CgA Tgg CTg CAg CTT - 3' (SEQ ID NO:2). 

Annealing of the template results in juxtaposition of the 5' and 3' ends of the template 
molecule. The 3 'OH of the anchor primer can be extended using the circular template. 

The use of a circular template and an anchor primer for identification of single 
nucleotide polymorphisms is shown in Figure 1 1C. Shown is a generic anchor primer 

25 having the sequence 5' - gAC CTC ACA CgA Tgg CTg CAg CTT - 3 '(SEQ ID 

NO:3). The anchor primer anneals to an SNP probe having the sequence 5' - TTT ATA 
TgT ATT CTA CgA CTC Tgg AgT gTg CTA CCg ACg TCg AAt CCg TTg ACT CTT 
ATC TTC A-3' (SEQ ID NO:4). The SNP probe in turn hybridizes to a region of a 
SNP-containing region of a gene having the sequence 5' - CTA gCT CgT ACA TAT 

30 AAA TgA AgA TAA gAT CCT g - 3' (SEQ ID NO:5). Hybridization of a nucleic acid 
sequence containing the polymorphism to the SNP probe complex allows for subsequent 
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ligation and circularization of the SNP probe. The SNP probe is designed so that its 5' 
and 3' termini anneal to the genomic region so as to abut in the region of the polymorphic 
site, as is indicated in Figure 1 1C. The circularized SNP probe can be subsequently 
extended and sequenced using the methods described herein. A nucleic acid lacking the 
5 polymorphism does not hybridize so as to result in juxtaposition of the 5' and 3' termini 
of the SNP probe. In this case, the SNP probe cannot be ligated to form a circular 
substrate needed for subsequent extension. 

Figure 1 ID illustrates the use of a gap oligonucleotide to along with a circular 
template molecule. An anchor primer having the sequence 5'-gAC CTCACACgA 

10 gTA gCA Tgg CTg CAg CTT - T (SEQ ID NO:6) is attached to a surface through a 
biotin linker. A template molecule having the sequence 5' - TCg TgT gAg gTC TCA 
gCA TCT TAT gTA TAT TTA CTT CTA TTC TCA gTT gCC TAA gCT gCA gCC A - 
3' (SEQ ID NO:7) is annealed to the anchor primer to result in partially single stranded, 
or gapped region, in the anchor primer flanked by a double-stranded region. A gapping 

1 5 molecule having the sequence 5 ' - TgC TAC - 3' then anneals to the anchor primer. 
Ligation of both ends of the gap oligonucleotide to the template molecule results in 
formation of a circular nucleic acid molecule that can act as a template for rolling circle 
amplification. 

RCA can occur when the replication of the duplex molecule begins at the origin. 

20 Subsequently, a nick opens one of the strands, and the free 3 '-terminal hydroxyl moiety 
generated by the nick is extended by the action of DNA polymerase. The newly 
synthesized strand eventually displaces the original parental DNA strand. This 
aforementioned type of replication is known as rolling-circle replication (RCR) because 
the point of replication may be envisaged as "rolling around" the circular template strand 

25 and, theoretically, it could continue to do so indefinitely. Additionally, because the 

newly synthesized DNA strand is covalently-bound to the original template, the displaced 
strand possesses the original genomic sequence gene or other sequence of interest) 
at its S'-tenninus. In RCR, the original genomic sequence is followed by any number of 
"replication units" complementary to the original template sequence, wherein each 

30 replication unit is synthesized by continuing revolutions of said original template 
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sequence. Hence, each subsequent revolution displaces the DNA which is synthesized in 
the previous replication cycle. 

Through the use of the RCA reaction, a strand may be generated which represents 
many tandem copies of the complement to the circularized molecule. For example, RCA 
5 has recently been utilized to obtain an isothermal cascade amplification reaction of 
circularized padlock probes in vitro in order to detect single-copy genes in human 
genomic DNA samples (see Lizardi, et al, 1998. Nat. Genet 19: 225-232). In addition, 
RCA has also been utilized to detect single DNA molecules in a solid phase-based assay, 
although difficulties arose when this technique was applied to in situ hybridization (see 

10 Lizardi, etai, 1998. Nat. Genet. 19: 225-232). 

If desired, RCA can be performed at elevated temperatures, e.g., at temperatures 
greater than 37° C, 42° C, 45° C, 50° C, 60° C, or 70° C. In addition, RCA can be 
performed initially at a lower temperature, e.g., room temperature, and then shifted to an 
elevated temperature. Elevated temperature RCA is preferably performed with 

15 thermostable nucleic acid polymerases and with primers that can anneal stably and with 
specificity at elevated temperatures. 

RCA can also be performed with non-naturally occurring oligonucleotides, e.g., 
peptide nucleic acids. Further, RCA can be performed in the presence of auxiliary 
proteins such as single-stranded binding proteins. 

20 The development of a method of amplifying short DNA molecules which have 

been immobilized to a solid support, termed RCA has been recently described in the 
literature (see e.g., Hatch, et al., 1999. Genet. Anal. Biomol. Engineer. 15: 35-40; Zhang, 
etai, 1998.Ge«e211: 277-85; Baner, et al. \99Z.NucL Acids Res. 26: 5073-5078; Liu, 
etai, \995.J.Am. Chem.Soc. 118: 1587-1594; Fire and Xu, 1995. Proc. Natl Acad. 

IS Sci. USA 92: 4641-4645; Nilsson, et al, 1994. Science 265: 2085-2088). RCA targets 
specific DNA sequences through hybridization and a DNA ligase reaction. The circular 
product is then subsequently used as a template in a rolling circle replication reaction. 

Other examples of isothermal amplification systems include, e.g., (/) self- 
sustaining, sequence replication (see e.g., Guatelli, et al, 1990. Proc. Natl. Acad. Sci. 

30 USA 87: 1874-1878), (ii) the Q(5 replicase system (see e.g., Lizardi, et al., 1988. 
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BioTechnology 6: 1 197-1202), and (hi) nucleic acid sequence-based amplification 
(NASBA™;seeKievits,<?/a/., 1991.7. Virol. Methods 35: 273-286). 

PCR Amplification of Nucleic Acid Templates 

5 In a preferred embodiment, polymerase chain reaction ("PCR") is used to 

generate additional copies of the template nucleic acids. The PCR amplification step may 
be performed prior to distribution of the nucleic acid templates onto the picotiter plate or 
may be performed after the nucleic acid templates have been distributed onto the picotiter 
plate. 

1 0 Bead Emulsion PCR Amplification 

In a preferred embodiment, a PCR amplification step is performed prior to 
distribution of the nucleic acid templates onto the picotiter plate. 

In a particularly preferred embodiment, a novel amplification system, herein 
termed "bead emulsion amplification" is performed by attaching a template nucleic acid 
1 5 (e.g., DNA) to be amplified to a solid support, preferably in the form of a generally 
spherical bead. A library of single stranded template DNA prepared according to the 
sample prepration methods of this invention is an example of one suitable source of the 
starting nucleic acid template library to be attached to a bead for use in this amplification 
method. 

20 The bead is linked to a large number of a single primer species (/.&, primer B in 

Figure 6) that is complementary to a region of the template DNA. Template DNA 
annealed to the bead bound primer. The beads are suspended in aqueous reaction mixture 
and then encapsulated in a water-in-oil emulsion. The emulsion is composed of discrete 
aqueous phase microdroplets, approximately 60 to 200 um in diameter, enclosed by a 

25 thermostable oil phase. Each microdroplct contains, preferably, amplification reaction 
solution (i.e., the reagents necessary for nucleic acid amplification). An example of an 
amplification would be a PCR reaction mix (polymerase, salts, dNTPs) and a pair of PCR 
primers (primer A and primer B). See, Figure 6A. A subset of the microdroplet 
population also contains the DNA bead comprising the DNA template. This subset of 

30 microdroplet is the basis for the amplification. The microcapsules that are not within this 
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subset have no template DNA and will not participate in amplification. In one 
embodiment, the amplification technique is PCR and the PCR primers are present in a 8:1 
or 16:1 ratio (i.e., 8 or 16 of one primer to 1 of the second primer) to perform asymmetric 
PCR. 

5 In this overview, the DNA is annealed to an oligonucleotide (primer B) which is 

immobilized to a bead. During thermocycling (Figure 6B), the bond between the single 
stranded DNA template and the immobilized B primer on the bead is broken, releasing 
the template into the surrounding microencapsulated solution. The amplification 
solution, in this case, the PCR solution, contains addition solution phase primer A and 
10 primer B. Solution phase B primers readily bind to the complementary b' region of the 
template as binding kinetics are more rapid for solution phase primers than for 
immobilized primers. In early phase PCR, both A and B strands amplify equally well 
(Figure 6C). 

By midphase PCR (i.e., between cycles 10 and 30) the B primers are depleted, 
1 5 halting exponential amplification. The reaction then enters asymmetric amplification and 
the amplicon population becomes dominated by A strands (Figure 6D). In late phase 
PCR (Figure 6E), after 30 to 40 cycles, asymmetric amplification increases the 
concentration of A strands in solution. Excess A strands begin to anneal to bead 
immobilized B primers. Thermostable polymerases then utilize the A strand as a 
20 template to synthesize an immobilized, bead bound B strand of the amplicon. 

In final phase PCR (Figure 6F), continued thermal cycling forces additional 
annealing to bead bound primers. Solution phase amplification may be minimal at this 
stage but concentration of immobilized B strands increase. Then, the emulsion is broken 
and the immobilized product is rendered single stranded by denaturing (by heat, pH etc.) 
25 which removes the complimentary A strand. The A primers are annealed to the A' region 
of immobilized strand, and immobilized strand is loaded with sequencing enzymes, and 
any necessary accessory proteins. The beads are then sequenced using recognized 
pyrophosphate techniques (described, e.g., in US patent 6,274,320, 6258,568 and 
6,210,891, incorporated in toto herein by reference). 

30 
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Template design 

In a preferred embodiment, the DNA template to be amplified by bead emulsion 
amplification can be a population of DNA such as, for example, a genomic DNA library 
5 or a cDNA library. It is preferred that each member of the population have a common 
nucleic acid sequence at the first end and a common nucleic acid sequence at a second 
end. This can be accomplished, for example, by ligating a first adaptor DNA sequence to 
one end and a second adaptor DNA sequence to a second end of the DNA population. 
Many DNA and cDNA libraries, by nature of the cloning vector (e.g., Bluescript, 

10 Stratagene, La Jolla, CA) fit this description of having a common sequence at a first end 
and a second common sequence at a second end of each member DNA. The DNA 
template may be of any size amenable to in vitro amplification (including the preferred 
amplification techniques of PCR and asymmetric PCR). In a preferred embodiment, the 
DNA template is between about 150 to 750 bp in size, such as, for example about 250 bp 

15 in size. 

Binding Nucleic Acid Template to Capture Beads 

In a first step, a single stranded nucleic acid template to be amplified is attached 
to a capture bead. The nucleic acid template may be attached to the solid support capture 

20 bead in any manner known in the art. Numerous methods exist in the art for attaching 
DNA to a solid support such as the preferred microscopic bead. According to the present 
invention, covalent chemical attachment of the DNA to the bead can be accomplished by 
using standard coupling agents, such as water-soluble carbodiimide, to link the 5'- 
phosphate on the DNA to amine-coated capture beads through a phosphoamidate bond. 

25 Another alternative is to first couple specific oligonucleotide linkers to the bead using 
similar chemistry, and to then use DNA ligase to link the DNA to the linker on the bead. 
Other linkage chemistries to join the oligonucleotide to the beads include the use of N- 
hydroxysuccinamidc (NHS) and its derivatives. In such a method, one end of the 
oligonucleotide may contain a reactive group (such as an amide group) which forms a 

30 covalent bond with the solid support, while the other end of the linker contains a second 
reactive group that can bond with the oligonucleotide to be immobilized. In a preferred 
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embodiment, the oligonucleotide is bound to the DNA capture bead by covalent linkage. 
However, non-covalent linkages, such as chelation or antigen-antibody complexes, may 
also be used to join the oligonucleotide to the bead. 

Oligonucleotide linkers can be employed which specifically hybridize to unique 
5 sequences at the end of the DNA fragment, such as the overlapping end from a restriction 
enzyme site or the "sticky ends" of bacteriophage lambda based cloning vectors, but 
blunt-end ligations can also be used beneficially. These methods are described in detail 
in US 5,674,743. It is preferred that any method used to immobilize the beads will 
continue to bind the immobilized oligonucleotide throughout the steps in the methods of 
10 the invention. 

In one embodiment, each capture bead is designed to have a plurality of nucleic 
acid primers that recognize (i.e., are complementary to) a portion of the nucleic template, 
and the nucleic acid template is thus hybridized to the capture bead. In the methods 
described herein, clonal amplification of the template species is desired, so it is preferred 

1 5 that only one unique nucleic acid template is attached to any one capture bead. 

The beads used herein maybe of any convenient size and fabricated from any 
number of known materials. Example of such materials include: inorganics, natural 
polymers, and synthetic polymers. Specific examples of these materials include: 
cellulose, cellulose derivatives, acrylic resins, glass, silica gels, polystyrene, gelatin, 

20 polyvinyl pyrrolidone, co-polymers of vinyl and acrylamide, polystyrene cross-linked 
with divinylbenzene or the like (as described, e.g., in Merrifield, Biochemistry 1964, 3, 
1385-1390), polyacrylamides, latex gels, polystyrene, dextran, rubber, silicon, plastics, 
nitrocellulose, natural sponges, silica gels, control pore glass, metals, cross-linked 
dextrans (e.g., Sephadex™) agarose gel (Sepharose™), and solid phase supports known 

25 to those of skill in the art. In a preferred embodiment, the capture beads are Sepharose 
beads approximately 25 to 40 \im in diameter. 



Emulsification 

Capture beads with attached single strand template nucleic acid are emulsified as 
30 a heat stable water-in-oil emulsion. The emulsion may be formed according to any 

suitable method known in the art. One method of creating emulsion is described below 
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but any method for making an emulsion may be used. These methods are known in the 
art and include adjuvant methods, counterflow methods, crosscurrent methods, rotating 
drum methods, and membrane methods. Furthermore, the size of the microcapsules may 
be adjusted by varying the flow rate and speed of the components. For example, in 
5 dropwise addition, the size of the drops and the total time of delivery may be varied. 
Preferably, the emulsion contains a density of bead "microreactors" at a density of about 
3,000 beads per microliter. 

The emulsion is preferably generated by suspending the template-attached beads 
in amplification solution. As used herein, the term "amplification solution" means the 

10 sufficient mixture of reagents that is necessary to perform amplification of template 
DNA. One example of an amplification solution, a PCR amplification solution, is 
provided in the Examples below - it will be appreciated that various modifications may 
be made to the PCR solution. 

In one embodiment, the bead / amplification solution mixture is added dropwise 

15 into a spinning mixture of biocompatible oil (e.g., light mineral oil, Sigma) and allowed 
to emulsify. The oil used may be supplemented with one or more biocompatible 
emulsion stabilizers. These emulsion stabilizers may include Atlox 4912, Span 80, and 
other recognized and commercially available suitable stabilizers. Preferably, the droplets 
formed range in size from 5 micron to 500 microns, more preferably, from between about 

20 50 to 300 microns, and most preferably, from 100 to 150 microns. 

There is no limitation in the size of the microreactors. The microrcactors should 
be sufficiently large to encompass sufficient amplification reagents for the degree of 
amplification required. However, the microreactors should be sufficiently small so that a 
population of microreactors, each containing a member of a DNA library, can be 

25 amplified by conventional laboratory equipment (e.g., PCR thermocycling equipment, 
test tubes, incubators and the like). 

With the limitations described above, the optimal size of a microreaclor may be 
between 100 to 200 microns in diameter. Microreactors of this size would allow 
amplification of a DNA library comprising about 600,000 members in a suspension of 

30 microreactors of less than 10 ml in volume. For example, if PCR was the chosen 

amplification method, 10 mis would fit in 96 tubes of a regular thermocycler with 96 tube 
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capacity. In a preferred embodiment, the suspension of 600,000 microreactors would 
have a volume of less than 1 ml. A suspension of less than 1 ml may be amplified in 
about 10 tubes of a conventional PCR thermocycler. In a most preferred embodiment, 
the suspension of 600,000 microreactors would have a volume of less than 0.5 ml. 

5 

Amplification 

After encapsulation, the template nucleic acid may be amplified by any suitable 
method of DNA amplification including transcription-based amplification systems 
(Kwoh D. et al., Proc. Natl. Acad Sci. (U.S.A.) 86:1 173 (1989); Gingeras T. R. et al., 

10 PCT appl. WO 88/10315; Davey, C. et al., European Patent Application Publication No. 
329,822; Miller, H. I. et al., PCT appl. WO 89/06700, and "race" (Frohman, M. A., In: 
PCR Protocols: A Guide to Methods and Applications, Academic Press, NY (1990)) and 
"one-sided PCR" (Ohara, O. et al., Proc. Natl. Acad. Sci. (U.S.A.) 86.5673-5677 (1989)). 
Still other less common methods such as "di-oligonucleotide" amplification, isothermal 

15 amplification (Walker, G. T. et al., Proc. Natl. Acad. Sci. (U.S.A.) 89:392-396 (1992)), 
and rolling circle amplification (reviewed in 5,714,320), may be used in the present 
invention. 

In a preferred embodiment, DNA amplification is performed by PCR. PCR 
according to the present invention may be performed by encapsulating the target nucleic 

20 acid, bound to a bead, with a PCR solution comprising all the necessary reagents for 
PCR. Then, PCR may be accomplished by exposing the emulsion to any suitable 
thermocycling regimen known in the art. In a preferred embodiment, between 30 and 50 
cycles, preferably about 40 cycles, of amplification are performed. It is desirable, but not 
necessary, that following the amplification procedure there be one or more hybridization 

25 and extension cycles following the cycles of amplification. In a preferred embodiment, 
between 10 and 30 cycles, preferably about 25 cycles, of hybridization and extension are 
performed (e.g., as described in the examples). Routinely, the template DNA is amplified 
until typically at least two million to fifty million copies, preferably about ten million to 
thirty million copies of the template DNA are immobilized per bead. 

30 
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Breaking the Emulsion and Bead Recovery 

Following amplification of the template, the emulsion is "broken" (also referred 
to as "demulsification" in the art). There are many methods of breaking an emulsion 
5 (see, e.g., U.S. Patent 5,989,892 and references cited therein) and one of skill in the art 
would be able to select the proper method. In the present invention, one preferred 
method of breaking the emulsion is to add additional oil to cause the emulsion to separate 
into two phases. The oil phase is then removed, and a suitable organic solvent (e.g., 
hexanes) is added. After mixing, the oil/organic solvent phase is removed. This step 
10 may be repeated several times. Finally, the aqueous layers above the beads are removed. 
The beads are then washed with an organic solvent / annealing buffer mixture (e.g., one 
suitable annealing buffer is described in the examples), and then washed again in 
annealing buffer. Suitable organic solvents include alcohols such as methanol, ethanol 
and the like. 

1 5 The amplified template-containing beads may then be resuspended in aqueous 

solution for use, for example, in a sequencing reaction according to known technologies. 
(See, Sanger, F. et al, Proc. Natl. Acad. Sci. U.S.A. 75, 5463-5467 (1977); Maxam, A. 
M. & Gilbert, W. Proc Natl Acad Sci USA 74, 560-564 (1977); Ronaghi, M. et ah, 
Science 281, 363, 365 (1998); Lysov, I. et al., Dokl Akad Nauk SSSR 303, 1508-151 1 

20 (1988); Bains W. & Smith G. C. J.TheorBiol 135, 303-307(1988); Drnanac, R. et al., 
Genomics 4, 1 14-128 (1989); Khrapko, K. R. et al., FEBS Lett 256. 1 1 8-122 (1989); 
Pevzner P. A. J Biomol Struct Dyn 7, 63-73 (1989); Southern, E. M. et al., Genomics 13, 
1008-1017 (1992).) If the beads are to be used in a pyrophosphale-based sequencing 
reaction (described, e.g., in US patent 6,274,320, 6258,568 and 6,210,891, and 

25 incorporated in toto herein by reference), then it is necessary to remove the second strand 
of the PCR product and anneal a sequencing primer to the single stranded template that is 
bound to the bead. 

Briefly, the second strand is melted away using any number of commonly known 
methods such as NaOH, low ionic (e.g., salt) strength, or heat processing. Following this 
30 melting step, the beads are pelleted and the supernatant is discarded. The beads arc 
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resuspended in an annealing buffer, the sequencing primer added, and annealed to the 
bead-attached single stranded template using a standard annealing cycle. 

Purifying the beads 

5 At this point, the amplified DNA on the bead may be sequenced either directly on 

the bead or in a different reaction vessel, hi an embodiment of the present invention, the 
DNA is sequenced directly on the bead by transferring the bead to a reaction vessel and 
subjecting the DNA to a sequencing reaction (e.g., pyrophosphate or Sanger sequencing). 
Alternatively, the beads may be isolated and the DNA may be removed from each bead 

10 and sequenced. In either case, the sequencing steps may be performed on each individual 
bead. However, this method, while commercially viable and technically feasible, may 
not be most effective because many of the beads will be negative beads (a bead that does 
not have amplified DNA attached). Accordingly, the following optional process may be 
used for removing beads that contain no nucleic acid template prior to distribution onto 

15 the picotiter plate. 

A high percentage of the beads may be "negative" (i.e., have no amplified nucleic 
acid template attached thereto) if the goal of the initial DNA attachment is to minimize 
beads with two different copies of DNA. For useful pyrophosphate sequencing, each 
bead should contain multiple copies of a single species of DNA. This requirement is 

20 most closely approached by maximizing the total number of beads with a single fragment 
of DNA bound (before amplification). This goal can be achieved by the observation of a 
mathematical model. 

For the general case of "N" number of DNAs randomly distributed among M 
number of beads, the relative bead population containing any number of DNAs depends 

25 on the ratio of N/M. The fraction of beads containing N DNAs R(N) may be calculated 
using the Poisson distribution: 

R(N) = exp-(N/M) X (N/M) N /N! (where X is the multiplication symbol) 
The table below shows some calculated values for various N/M (the average DNA 
fragment to bead ratio) and N (the number of fragments actually bound to a bead). 

30 
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N/M 


0.1 


0.5 


1 


2 


R(0) 


0.9 


0.61 


0.37 


0.13 


R(D 


0.09 


0.3 


0.37 


0.27 


R(N>1) 


0.005 


0.09 


0.26 


0.59 



. In the table the top row denotes the various ratios of N/M. R(0) denotes the 
fraction of beads with no DNA, R(l) denotes the fraction of beads with one DNA 
5 attached (before amplification) and R(N>1) denotes the fraction of DNA with more than 
one DNA attached (before amplification). 

The table indicates that the maximum fraction of beads containing a single DNA 
fragment is 0.37 (37%) and occurs at a fragment to bead ratio of one. In this mixture, 
about 63% of the beads is useless for sequencing because they have either no DNA or 

1 0 more than a single species of DNA. Additionally, controlling the fragment to bead ratio 
require complex calculations and variability could produce bead batches with a 
significantly smaller fraction of useable beads. 

This inefficiency could be significantly ameliorated if beads containing amplicon 
(originating from the binding of at least one fragment) could be separated from those 

15 without amplicon (originating from beads with no bound fragments). An amplicon is 
defined as any nucleic acid molecules produced by an in vitro nucleic amplification 
technique. Binding would be done at low average fragment-to-bead ratios (N/M <1), 
minimizing the ratio of beads with more than one DNA bound. A separation step would 
remove most or all of the beads with no DNA leaving an enriched population of beads 

20 with one species of amplified DNA. These beads may be applied to any method of 
sequencing such as, for example, pyrophosphate sequencing. Because the fraction of 
beads with one amplicon (N=l) has been enriched, any method of sequencing would be 
more efficient. 

As an example, with an average fragment to bead ratio of 0.1 , 90% of the beads 
25 will have no amplicon, 9% of the beads would be useful with one amplicon, and 0.5% of 
the beads will have more than one amplicon. An enrichment process of the invention will 
remove the 90% of the zero amplicon beads leaving a population of beads where the 
sequenceable fraction (N=l) is: 
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1-(0.005/0.09)=94%. 

Dilution of the fragment to bead mixture, along with separation of beads 
containing amplicon can yield an enrichment of 2.5 folds over the optimal unenriched 
method. 94%/37% (See table above N/M=l) = 2.5. An additional benefit of the 
5 enrichment procedure of the invention is that the ultimate fraction of sequenceable beads 
is relatively insensitive to. variability in N/M. Thus, complex calculations to derive the 
optimal N/M ratio are either unnecessary or may be performed to a lower level of 
precision. This will ultimately make the procedure more suitable to performance by less 
trained personnel or automation. An additional benefit of the procedure is that the zero 

10 amplicon beads may be recycled and reused. While recycling is not necessary, it may 
reduce cost or the total bulk of reagents making the method of the invention more 
suitable for some purposes such as, for example, portable sampling, remote robotic 
sampling and the like. In addition, all the benefits of the procedure (i.e., less trained 
personnel, automation, recycling of reagents) will reduce the cost of the procedure. The 

1 5 procedure is described in more detail below. 

The enrichment procedure may be used to treat beads that have been amplified in 
the bead emulsion method above. The amplification is designed so that each amplified 
molecule contains the same DNA sequence at its 3' end. The nucleotide sequence may 
be a 20 mer but may be any sequence from 15 bases or more such as 25 bases, 30 bases, 

20 35 bases, or 40 bases or longer. Naturally, while longer oligonucleotide ends are 

functional, they are not necessary. This DNA sequence may be introduced at the end of 
an amplified DNA by one of skill in the art. For example, if PCR is used for 
amplification of the DNA, the sequence may be part of one member of the PCR primer 
pair. 

25 A schematic of the enrichment process is illustrated in Figure 7. Here, the 

amplicon-bound bead mixed with 4 empty beads represents the fragment-diluted 
amplification bead mixture. In step 1 , a biotinylated primer complementary to the 3 ? end 
of the amplicon is annealed to the amplicon. In step 2, DNA polymerase and the four 
natural deoxynucleotides triphosphates (dNTPs) are added to the bead mix and the 

30 biotinylated primer is extended. This extension is to enhance the bonding between the 
biotinylated primer and the bead-bound DNA. This step may be omitted if the 



35 



EM: EV262659147US 



21465-514 



biotinylated primer - DNA bond is strong (e.g., in a high ionic environment). In Step 3, 
streptavidin coated beads susceptible to attraction by a magnetic field (referred to herein 
as "magnetic streptavidin beads") are introduced to the bead mixtures. Magnetic beads 
are commercially available, for example, from Dynal (M290). The streptavidin capture 
5 moieties binds biotins hybridized to the amplicons, which then specifically fix the 
amplicon-bound beads to the magnetic streptavidin beads. 

In step 5, a magnetic field (represented by a magnet) is applied near the reaction 
mixture, which causes all the "magnetic streptavidin beads/amplicon bound bead 
complexes" to be positioned along one side of the tube most proximal to the magnetic 

10 field. Magnetic beads without amplicon bound beads attached are also expected to be 
positioned along the same side. Beads without amplicons remain in solution. The bead 
mixture is washed and the beads not immobilized by the magnet (i.e., the empty beads) 
are removed and discarded. In step 6, the extended biotinylated primer strand is 
separated from the amplicon strand by "melting" - a step that can be accomplished, for 

15 example, by heat or a change in pH. The heat may be 60°C in low salt conditions (e.g., in 
a low ionic environment such as 0. IX SSC). The change in pH may be accomplished by 
the addition of NaOH. The mixture is then washed and the supernatant, containing the 
amplicon bound beads, is recovered while the now unbound magnetic beads are retained 
by a magnetic field. The resultant enriched beads may be used for DNA sequencing. It 

20 is noted that the primer on the DNA capture bead may be the same as the primer of step 2 
above. In this case, annealing of the amplicon-primer complementary strands (with or 
without extension) is the source of target-capture affinity. 

The biotin streptavidin pair could be replaced by a variety of capture-target pairs. 
Two categories arc pairs whose binding can be subsequently cleaved and those which 

25 bind irreversibly, under conditions that are practically achievable. Cleavable pairs 
include thiol-thiol, Digoxigenin/ anti- Digoxigenin, -Captavidin™ if cleavage of the 
target-capture complex is desired. 

As described above, step 2 is optional. If step 2 is omitted, it may not be 
necessary to separate the magnetic beads from the amplicon bound beads. The amplicon 

30 bound beads, with the magnetic beads attached, may be used directly for sequencing. If 
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the sequencing were to be performed in a microwell, separation would not be necessary if 
the amplicon bound bead-magnetic bead complex can fit inside the microwell. 

While the use of magnetic capture beads is convenient, capture moieties can be 
bound to other surfaces. For example, streptavidin could be chemically bound to a 
5 surface, such as, the inner surface of a tube. In this case, the amplified bead mixture may 
be flowed through. The amplicon bound beads will tend to be retained until "melting" 
while the empty beads will flow through. This arrangement may be particularly 
advantageous for automating the bead preparation process. 

While the embodiments described above is particularly useful, other methods can 

10 be envisioned to separate beads. For example, the capture beads may be labeled with a 
fluorescent moiety which would make the target-capture bead complex fluorescent. The 
target capture bead complex may be separated by flow cytometry or fluorescence cell 
sorter. Using large capture beads would allow separation by filtering or other particle 
size separation techniques. Since both capture and target beads are capable of forming 

15 complexes with a number of other beads, it is possible to agglutinate a mass of cross- 
linked capture-target beads. The large size of the agglutinated mass would make 
separation possible by simply washing away the unagglutinated empty beads. The 
methods described are described in more detail, for example, in Bauer, J.; J. 
Chromatography B, 722 (1999) 55-69 and in Brody et al., Applied Physics Lett. 74 

20 (1999) 144-146. 

The DNA capture beads each containing multiple copies of a single species of 
nucleic acid template prepared according to the above method are then suitable for 
distribution onto the picotiter plate. 



25 Nucleic Acid Amplification On The Picotiter Plate 

In an alternative embodiment, the nucleic acid template is distributed onto the 
picotiter plate prior to amplification and then amplified in situ on the picotiter plate. This 
method is described in detail in the Examples. 
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3. SEQUENCING THE NUCLEIC ACID TEMPLATE 

Pyrophosphate sequencing is used according to the methods of this invention to 
5 sequence the nucleic acid template. This technique is based on the detection of released 
pyrophosphate (Ppi) during DNA synthesis. See, e.g., Hyman, 1988. A new method of 
sequencing DNA. Anal Biochem. 174:423-36; Ronaghi, 2001. Pyrosequencing sheds 
light on DNA sequencing. Genome Res. 1 1:3-1 1. 

In a cascade of enzymatic reactions, visible light is generated proportional to the 

10 number of incorporated nucleotides. The cascade starts with a nucleic acid 

polymerization reaction in which inorganic Ppi is released with nucleotide incorporation 
by polymerase. The released Ppi is converted to ATP by ATP sulfurylase, which 
provides the energy to luciferase to oxidize luciferin and generates light. Because the 
added nucleotide is known, the sequence of the template can be determined. Solid-phase 

15 pyrophosphate sequencing utilizes immobilized DNA in a three-enzyme system (see 
Figures). To increase the signal-to-noise ratio, the natural dATP has been replaced by 
dATPaS. Typically dATPaS is a mixture of two isomers (Sp and Rp); the use of pure 2*- 
deoxyadenosine-S-O- (1-thiotriphosphate) Sp-isomer in pyrophosphate sequencing 
allows substantially longer reads, up to doubling of the read length. 

20 4. Apparatus for Sequencing Nucleic Acids 

This invention provides an apparatus for sequencing nucleic acids, which 
generally comprises one or more reaction chambers for conducting a sequencing reaction, 
means for delivering reactants to and from the reaction chamber(s), and means for 
detecting a sequencing reaction event. In another embodiment, the apparatus includes a 

25 reagent delivery cuvette containing a plurality of cavities on a planar surface. In a 

preferred embodiment, the apparatus is connected to at least one computer for controlling 
the individual components of the apparatus and for storing and/or analyzing the 
information obtained from detection of the sequence reaction event. 

The invention also provides one or more reaction chambers are arranged on an 

30 inert substrate material, also referred to herein as a "solid support", that allows for 
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discrete localization of the nucleic acid template and of the reactants in a sequencing 
reaction in a defined space, as well as for detection of the sequencing reaction event. 
Thus, as used herein, the terms "reaction chamber" or "analyte reaction chamber" refer to 
a localized area on the substrate material that facilitates interaction of reactants, e.g., in a 
5 nucleic acid sequencing reaction. As discussed more fully below, the sequencing 

reactions contemplated by the invention preferably occur on numerous individual nucleic 
acid samples in tandem, in particular simultaneously sequencing numerous nucleic acid 
samples derived from genomic and chromosomal nucleic acid templates (e.g., DNA). 

The apparatus of the invention therefore preferably comprises a sufficient number 

10 of reaction chambers to carry out such numerous individual sequencing reactions. In one 
embodiment, there are at least 10,000 reaction chambers, preferably at least 50,000 
reaction chambers, more preferably greater than 100,000 reaction chambers, even more 
preferably greater than 200,000 reaction chambers. 

Since the number of simultaneous sequencing reactions is limited by the number 

15 of reaction chambers, the throughput may be increased by fabricating plates containing 
increasing densities of wells. Table 1 shows this progression for a 14 x 43 mm and 30 x 
60 mm active areas, derived from 25 x 75 mm and 40 x 75 mm arrays, respectively. 



20 



25 



Table 1. Development of higher well count arrays. 



Pitch 


Well 


# of Wells 


# of Wells 


(um) 


Diameter (urn) 


(14 x 43 mm) 


(30 x 60 mm) 


50 


44 


275K 


800K 


43 


38 


375K 


1.2M 


35 


31 


575K 


1.6M 


25 


22 


1.1M 


3.2M 



The reaction chambers on the array typically take the form of a cavity or well in 
the substrate material, having a width and depth, into which reactants can be deposited. 
30 Typically the nucleic acid template is distributed into the reaction chamber on one or 
more solid supports or beads; the reactants are in a medium which facilitates the reaction 
and which flows through the reaction chamber. When formed as cavities or wells, the 
chambers are preferably of sufficient dimension and order to allow for (i) the introduction 
of the necessary reactants into the chambers, (ii) reactions to take place within the 
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chamber and (iii) inhibition of mixing of reactants between chambers. The shape of the 
well or cavity is preferably circular or cylindrical, but can be multisided so as to 
approximate a circular or cylindrical shape. In one preferred embodiment, the shape of 
the well or cavity is substantially hexagonal. The cavity can have a smooth wall surface. 
5 In an additional embodiment, the cavity can have at least one irregular wall surface. The 
cavities can have a planar bottom or a concave bottom. 

The reaction chambers can be spaced between 5\im and 200|am apart. Spacing is 
determined by measuring the center- to-center distance between two adjacent reaction 
chambers. Typically, the reaction chambers can be spaced between lOfim and I50[im 

1 0 apart, preferably between 20^im and 1 00\im apart, most preferably between 40 and 60 urn 
apart. In one embodiment, the reaction chambers have a width (diameter) in one 
dimension of between 0.3jim and 100|.im, more preferably between 20^m and 70jim and 
most preferably about 30 and 50 |im. The depth of the reaction chambers are preferably 
between lOfim and lOOfxm, preferably between 20 ^im and 60^m. Alternatively, the 

1 5 reaction chambers may have a depth that is between 0.25 and 5 times the width in one 
dimension of the reaction chamber or, in another embodiment, between 0.3 and 1 times 
the width in one dimension of the reaction chamber. 

In a preferred embodiment, the array is fashioned from a sliced fiber optic bundle 
(i.e., a bundle of fused fiber optic cables) and the reaction chambers are formed by 

20 etching one surface of the fiber optic reactor array. The cavities can also be formed in the 
substrate via etching, molding or micromachining. 

Each cavity or reaction chamber typically has a depth of between 10 |im and 100 
|im; alternatively, the depth is between 0.25 and 5 times the size of the width of the 
cavity, preferably between 0.3 and 1 times the size of the width of the cavity. 

25 In one embodiment, the arrays described herein typically include a planar top 

surface and a planar bottom surface, which is optically conductive such that optical 
signals from the reaction chambers can be detected through the bottom planar surface. In 
these arrays, typically the distance between the top surface and the bottom surface is no 
greater than 10 cm, preferably no greater than 2 cm, and usually between 0.5 mm to 

30 5 mm, most preferably about 2 mm. 
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In a particularly preferred embodiment, the solid support is termed a 
picotiterplate, with reaction chambers having a center to center spacing of about 43 
to 50 p,m, a well diameter of between 38 |im to 44 jam, and a well volume of between 
10 to 150 pL, preferably between 20 to 90 pL, more preferably between 40 to 85 pL, and 
5 most preferably about 75 pL. 

In one embodiment, each cavity or reaction chamber of the array contains 
reagents for analyzing a nucleic acid or protein. Typically those reaction chambers that 
contain a nucleic acid (not all reaction chambers in the array are required to) contain only 
a single species of nucleic acid (i.e., a single sequence that is of interest). There may be a 

1 0 single copy of this species of nucleic acid in any particular reaction chamber, or there 
may be multiple copies. It is generally preferred that a reaction chamber contain at least 
100,000 copies of the nucleic acid template sequence, preferably at least 1,000,000 
copies, and more preferably between 2,000,000 to 20,000,000 copies, and most 
preferably between 5,000,000 to 15,000,000 copies of the nucleic acid. The ordinarily 

1 5 skilled artisan will appreciate that changes in the number of copies of a nucleic acid 
species in any one reaction chamber will affect the number of photons generated in a 
pyrosequencing reaction, and can be routinely adjusted to provide more or less photon 
signal as is required. In one embodiment the nucleic acid species is amplified to provide 
the desired number of copies using PCR, RCA, ligase chain reaction, other isothermal 

20 amplification, or other conventional means of nucleic acid amplification. In one 
embodiment, the nucleic acid is single stranded. 

Solid Support Material 

Any material can be used as the solid support material, as long as the surface 
allows for stable attachment of the primers and detection of nucleic acid sequences. The 
25 solid support material can be planar or can be cavitated, e.g., in a cavitated terminus of a 
fiberoptic or in a microwell etched, molded, or otherwise micromachined into the planar 
surface, e.g. using techniques commonly used in the construction of 
microelectromechanical systems. See e.g., Rai-Choudhury, Handbook OF 

MlCROLITHOGRAPHY, MlCROMACHINING, AND MlCROFABRlCATlON, VOLUME I: 

30 MlCROLITHOGRAPHY, Volume PM39, SPIE Press (1997); Madou, CRC Press (1997), 
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Aoki, Biotech. Histochem. 67: 98-9 (1992); Kane et a/., Biomaterials. 20: 2363-76 

(1999) ; Deng et al., Anal. Chenu 72:3 1 76-80 (2000); Zhu et a/., Nat. Genet. 26:283-9 

(2000) . In some embodiments, the solid support is optically transparent, e.g., glass. 
An array of attachment sites on an optically transparent solid support can be 

5 constructed using lithographic techniques commonly used in the construction of 

electronic integrated circuits as described in, e.g., techniques for attachment described in 
U.S. Patent Nos. 5,143,854, 5,445,934, 5,744,305, and 5,800,992; Chee et «/., Science 
274: 610-614 (1996); Fodor et a/., Nature 364: 555-556 (1993); Fodor et al, Science 
251: 767-773 (1991); Gushin,*f al, Anal Biochem. 250: 203-211 (1997); Kinositae/ 

10 al, Ce//93: 21-24(1998); Kato-Yamada et al, J. Biol. Chenu 273: 19375-19377 
(1998); and Yasuda et al, Cell 93: 1 1 17-1 124 (1998). Photolithography and electron 
beam lithography sensitize the solid support or substrate with a linking group that allows 
attachment of a modified biomolecule {e.g., proteins or nucleic acids). See e.g., Service, 
Science 283: 27-28 (1999); Rai-Choudhury, HANDBOOK OF Microlithography, 

1 5 MlCROMACHINING, AND MICRO FABRICATION, VOLUME I: MlCROLITHOGRAPHY, Volume 
PM39, SPIE Press (1997). Alternatively, an array of sensitized sites can be generated 
using thin-film technology as described in Zasadzinski et al., Science 263 : 1 726- 1 733 
(1994). 

The substrate material is preferably made of a material that facilitates detection of 
20 the reaction event. For example, in a typical sequencing reaction, binding of a dNTP to a 
sample nucleic acid to be sequenced can be monitored by detection of photons generated 
by enzyme action on phosphate liberated in the sequencing reaction. Thus, having the 
substrate material made of a transparent or light conductive material facilitates detection 
of the photons. 

25 In some embodiments, the solid support can be coupled to a bundle of optical 

fibers that are used to detect and transmit the light product. The total number of optical 
fibers within the bundle may be varied so as to match the number of individual reaction 
chambers in the array utilized in the sequencing reaction. The number of optical fibers 
incorporated into the bundle is designed to match the resolution of a detection device so 

30 as to allow 1:1 imaging. The overall sizes of the bundles are chosen so as to optimize the 
usable area of the detection device while maintaining desirable reagent (flow) 



42 



E/M: EV262659I47US 



21465-514 



characteristics in the reaction chamber. Thus, for a 4096 x 4096 pixel CCD (charge- 
coupled device) array with 15 |im pixels, the fiber bundle is chosen to be approximately 
60 mm x 60 mm or to have a diameter of approximately 90 mm. The desired number of 
optical fibers are initially fiised into a bundle or optical fiber array, the terminus of which 
5 can then be cut and polished so as to form a "wafer" of the required thickness (e.g., 

1.5 mm). The resulting optical fiber wafers possess similar handling properties to that of 
a plane of glass. The individual fibers can be any size diameter (e.g., 3^m to 100 urn). 

In some embodiments two fiber optic bundles are used: a first bundle is attached 
directly to the detection device (also referred to herein as the fiber bundle or connector) 

10 and a second bundle is used as the reaction chamber substrate (the wafer or substrate). In 
this case the two are placed in direct contact, optionally with the use of optical coupling 
fluid, in order to image the reaction centers onto the detection device. If a CCD is used 
as the detection device, the wafer could be slightly larger in order to maximize the use of 
the CCD area, or slightly smaller in order to match the format of a typical microscope 

15 slide — 25 mm x 75 mm. The diameters of the individual fibers within the bundles are 
chosen so as to maximize the probability that a single reaction will be imaged onto a 
single pixel in the detection device, within the constraints of the state of the art. 
Exemplary diameters are 6-8 ^m for the fiber bundle and 6-50 urn for the wafer, though 
any diameter in the range 3-1 00 fim can be used. Fiber bundles can be obtained 

20 commercially from CCD camera manufacturers. For example, the wafer can be obtained 
from Incom, Inc. (Charlton, MA) and cut and polished from a large fusion of fiber optics, 
typically being 2 mm thick, though possibly being 0.5 to 5 mm thick. The wafer has 
handling properties similar to a pane of glass or a glass microscope slide. 

Reaction chambers can be formed in the substrate made from fiber optic material. 

25 The surface of the optical fiber is cavitated by treating the termini of a bundle of fibers, 
e.g., with acid, to form an indentation in the fiber optic material. Thus, in one 
embodiment cavities are formed from a fiber optic bundle, preferably cavities can be 
formed by etching one end of the fiber optic bundle. Each cavitated surface can form a 
reaction chamber. Such arrays are referred to herein as fiber optic reactor arrays or 

30 FORA. The indentation ranges in depth from approximately one-half the diameter of an 
individual optical fiber up to two to three times the diameter of the fiber. Cavities can be 
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introduced into the termini of the fibers by placing one side of the optical fiber wafer into 
an acid bath for a variable amount of time. The amount of time can vary depending upon 
the overall depth of the reaction cavity desired (see e.g., Walt, et aL t 1996. Anal. Chem. 
70: 1888). A wide channel cavity can have uniform flow velocity dimensions of 
5 approximately 14mm x 43mm. Thus, with this approximate dimension and at 

approximately 4.82 x 10" 4 cavities/um 2 density, the apparatus can have approximately 
290,000 fluidically accessible cavities. Several methods are known in the art for 
attaching molecules (and detecting the attached molecules) in the cavities etched in the 
ends of fiber optic bundles. See, e.g., Michael, et al 9 Anal Chem. 70: 1242-1248 

10 (1998); Ferguson, et aL Nature Biotechnology 14: 1681-1684(1996); Healey and Walt, 
Anal Chem. 69: 2213-2216 (1997). A pattern of reactive sites can also be created in the 
raicrowell, using photolithographic techniques similar to those used in the generation of a 
pattern of reaction pads on a planar support. See, Healey, et al, Science 269: 1078-1080 
(1995); Munkholm and Walt, Anal Chem. 58: 1427-1430 (1986), and Bronk, et al, 

15 Anal Chem. 67: 2750-2757 (1995). 

The opposing side of the optical fiber wafer (i.e., the non-etched side) is typically 
highly polished so as to allow optical-coupling (e.g., by immersion oil or other optical 
coupling fluids) to a second, optical fiber bundle. This second optical fiber bundle 
exactly matches the diameter of the optical wafer containing the reaction chambers, and 

20 serve to act as a conduit for the transmission of light product to the attached detection 
device, such as a CCD imaging system or camera. 

In one preferred embodiment, the fiber optic wafer is thoroughly cleaned, e.g. by 
serial washes in 15% H 2 0 2 /1 5%NH 4 OH volume:volume in aqueous solution, then six 
deionized water rinses, then 0.5M EDTA, then six deionized water, then 15% 

25 H 2 0 2 /15%NH 4 OH, then six deionized water (one-half hour incubations in each wash). 

The surface of the fiber optic wafer is preferably coated to facilitate its use in the 
sequencing reactions. A coated surface is preferably optically transparent, allows for 
easy attachment of proteins and nucleic acids, and does not negatively affect the activity 
of immobilized proteins. In addition, the surface preferably minimizes non-specific 

30 absorption of macromolecules and increases the stability of linked macromolecules (e.g., 
attached nucleic acids and proteins). 
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Suitable materials for coating the array include, e.g., plastic (e.g. polystyrene). 
The plastic can be preferably spin-coated or sputtered (0. 1 fim thickness). Other 
materials for coating the array include gold layers, e.g. 24 karat gold, 0.1 |.im thickness, 
with adsorbed self-assembling monolayers of long chain thiol alkanes. Biotin is then 
5 coupled covalently to the surface and saturated with a biotin-binding protein (e.g. 
streptavidin or avidin). 

Coating materials can additionally include those systems used to attach an anchor 
primer to a substrate. Organosilane reagents, which allow for direct covalent coupling of 
proteins via amino, sulfhydryl or carboxyl groups, can also be used to coat the array. 
10 Additional coating substances include photoreactive linkers, e.g. photobiotin, (Amos et 
al., "Biomaterial Surface Modification Using Photochemical Coupling Technology," in 
Encyclopedic Handbook of Biomaterials and Bioengineering, Part A: Materials, Wise et 
al. (eds.), New York, Marcel Dekker, pp. 895926, 1995), 

Additional coating materials include hydrophilic polymer gels (polyacrylamide, 
15 polysaccharides), which preferably polymerize directly on the surface or polymer chains 
covalently attached post polymerization (Hjerten, y. Chromatogr. 347,191 (1985); 
Novotny, Anal Chenu 62,2478 (1990), as well as pluronic polymers (triblock 
copolymers, e.g. PPO-PEO-PPO, also known as F-108), specifically adsorbed to either 
polystyrene or silanized glass surfaces (Ho et al., Langmuir 14:3889-94, 1998), as well 
20 as passively adsorbed layers of biotin-binding proteins. The surface can also be coated 
with an epoxide which allows the coupling of reagents via an amine linkage. 

In addition, any of the above materials can be derivatized with one or more 
functional groups, commonly known in the art for the immobilization of enzymes and 
nucleotides, e.g. metal chelating groups (e.g. nitrilo triacetic acid, iminodiacetic acid, 
25 pentadentate chelalor), which will bind 6xHis-tagged proteins and nucleic acids. 

Surface coatings can be used that increase the number of available binding sites 
for subsequent treatments, e.g. attachment of enzymes (discussed later), beyond the 
theoretical binding capacity of a 2D surface. 

In a preferred embodiment, the individual optical fibers utilized to generate the 
30 fused optical fiber bundle/wafer arc larger in diameter (i.e., 6 |im to 12 \i\w) than those 
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utilized in the optical imaging system (i.e.. 3 (am). Thus, several of the optical imaging 
fibers can be utilized to image a single reaction site. 

In a particularly preferred embodiment, the sample cartridge for nucleic acid 
templete sequencing, termed the TicoTiter plate' is formed from a commercial fiber 
5 optics faceplate, acid-etched to yield well structures. Each optic fiber core is about 44 
microns in diameter, with a 2-3 micron cladding, each well formed by acid etching to 
form a reaction well volume of about 65 pL to 85 pL, most preferably about 75 pL. The 
use of etched wells on a fiber optics faceplate surface serves a threefold purpose; i) 
delayed diffusion of the luminescence from emitting light in a different region of the 
10 array, ii) isolation of reaction chambers ("test-tubes") that contain the amplified template 
molecules, and iii) very efficient, high numerical aperture optical coupling to the CCD. 
Finally, the larger the amount of sequencing template immobilized within a well, the 
more optical signal one is able to achieve. 

Delivery Means 

15 An example of the means for delivering reactants to the reaction chamber is the 

perfusion chamber of the present invention is illustrated in Figure 13. The perfusion 
chamber includes a sealed compartment with transparent upper and lower side. It is 
designed to allow flow of solution over the surface of the substrate surface and to allow 
for fast exchange of reagents. Thus, it is suitable for carrying out, for example, the 

20 pyrophosphate sequencing reactions. The shape and dimensions of the chamber can be 
adjusted to optimize reagent exchange to include bulk flow exchange, diffusive 
exchange, or both in cither a laminar flow or a turbulent flow regime. 

The perfusion chamber is preferably detached from the imaging system while it is 
being prepared and only placed on the imaging system when sequencing analysis is 

25 performed. In one embodiment, the solid support (i.e. a DNA chip or glass slide) is held 
in place by a metal or plastic housing, which may be assembled and disassembled to 
allow replacement of said solid support. The lower side of the solid support of the 
perfusion chamber carries the reaction chamber array and, with a traditional optical-based 
focal system, a high numerical aperture objective lens is used to focus the image of the 

30 reaction center array onto the CCD imaging system. 
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Many samples can thereby be analyzed in parallel. Using the method of the 
invention, many nucleic acid templates may be analyzed in this was by allowing the 
solution containing the enzymes and one nucleotide to flow over the surface and then 
detecting the signal produced for each sample. This procedure can then be repeated. 
5 Alternatively, several different oligonucleotides complementary to the template may be 
distributed over the surface followed by hybridization of the template. Incorporation of 
deoxynucleotides or dideoxynucleotides may be monitored for each oligonucleotide by 
the signal produced using the various oligonucleotides as primer. By combining the 
signals from different areas of the surface, sequence-based analyses may be performed by 

10 four cycles of polymerase reactions using the various dideoxynucleotides. 

When the support is in the form of a cavitated array, e.g., in the termini of a 
picotiter plate or other array of microwells, suitable delivery means for reagents include 
flowing and washing and also, e.g., flowing, spraying, electrospraying, ink jet delivery, 
stamping, ultrasonic atomization (Sonotek Corp., Milton, NY) and rolling. When 

1 5 spraying is used, reagents may be delivered to the picotiter plate in a homogeneous thin 
layer produced by industrial type spraying nozzles (Spraying Systems, Co., Wheaton, IL) 
or atomizers used in thin layer chromatography (TLC), such as CAMAG TLC Sprayer 
(Camag Scientific Inc., Wilmington, NC). These sprayers atomize reagents into aerosol 
spray particles in the size range of 0.3 to 10 [im. 

20 Successive reagent delivery steps are preferably separated by wash steps using 

techniques commonly known in the art. These washes can be performed, e.g., using the 
above described methods, including high-flow sprayers or by a liquid flow over the 
picotiter plate or microwell array surface. The washes can occur in any time period after 
the starting material has reacted with the reagent to form a product in each reaction 

25 chamber but before the reagent delivered to any one reaction chamber has diffused out of 
that reaction chamber into any other reaction chamber. In one embodiment, any one 
reaction chamber is- independent of the product formed in any other reaction chamber, but 
is generated using one or more common reagents. 

An embodiment of a complete apparatus is illustrated in Figure 12. The apparatus 

30 includes an inlet conduit 200 in communication with a detachable perfusion chamber 
226. The inlet conduit 200 allows for entry of sequencing reagents via a plurality of 
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tubes 202-212, which are each in communication with a plurality of sequencing 
dispensing reagent vessels 214-224. 

Reagents are introduced through the conduit 200 into the perfusion chamber 226 
using either a pressurized system or pumps to drive positive flow. Typically, the reagent 
5 flow rates are from 0.05 to 50 ml/minute (e.g., 1 to 50 ml/minute) with volumes from 
0.100 ml to continuous flow (for washing). Valves are under computer control to allow 
cycling of nucleotides and wash reagents. Sequencing reagents, e.g., polymerase can be 
either pre-mixed with nucleotides or added in stream. A manifold brings all six tubes 
202-212 together into one for feeding the perfusion chamber. Thus several reagent 
10 delivery ports allow access to the perfusion chamber. For example, one of the ports may 
be utilized to allow the input of the aqueous sequencing reagents, while another port 
allows these reagents (and any reaction products) to be withdrawn from the perfusion 
chamber. 

In a preferred embodiment, one or more reagents are delivered to an array 

15 immobilized or attached to a population of mobile solid supports, e.g., a bead or 

microsphere. The bead or microsphere need not be spherical, irregular shaped beads may 
be used. They are typically constructed from numerous substances, e.g., plastic, glass or 
ceramic and bead sizes ranging from nanometers to millimeters depending on the width 
of the reaction chamber. Various bead chemistries can be used e.g., methylslyrene, 

20 polystyrene, acrylic polymer, latex, paramagnetic, thoria sol, carbon graphite and 

titanium dioxide. The construction or chemistry of the bead can be chosen to facilitate 
the attachment of the desired reagent. 

In another embodiment, the bioactive agents are synthesized first, and then 
covalently attached to the beads. As is appreciated by someone skilled in the art, this will 

25 be done depending on the composition of the bioactive agents and the beads. The 
functionalization of solid support surfaces such as certain polymers with chemically 
reactive groups such as thiols, amines, carboxyls, etc. is generally known in the art. 

In a preferred embodiment, the nucleic acid template is delivered to the picotiter 
plate on beads. The luciferase and sulfurylase enzymes are likewise delivered to each 

30 well on beads (see Figure), as is the DNA polymerase. It is noted that the one or more of 
the nucleic acid template, luciferase and sulfurylase maybe delivered each on separate 
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beads, or together on the same bead. To allow sequencing DNA at raised temperatures, 
we have cloned and modified the thermostable sulfurylase from Bacillus 
steareothermophilus. We have also cloned and modified several luciferase enzymes for 
solid-phase enzyme activity, including P. pennsylvanica and P. pyralis. The P. pyralis 
5 luciferase is used in a preferred embodiment. 

"Blank" beads may be used that have surface chemistries that facilitate the 
attachment of the desired functionality by the user. Additional examples of these surface 
chemistries for blank beads include, but are not limited to, amino groups including 
aliphatic and aromatic amines, carboxylic acids, aldehydes, amides, chloromelhyl groups, 

1 0 hydrazide, hydroxy! groups, sulfonates and sulfates. 

These functional groups can be used to add any number of different candidate 
agents to the beads, generally using known chemistries. For example, candidate agents 
containing carbohydrates may be attached to an amino-functionalized support; the 
aldehyde of the carbohydrate is made using standard techniques, and then the aldehyde is 

15 reacted with an amino group on the surface. In an alternative embodiment, a sulfhydryl 
linker may be used. There are a number of sulfhydryl reactive linkers known in the art 
such as SPDP, maleimides, a-haloacetyls, and pyridyl disulfides (see for example the 
1994 Pierce Chemical Company catalog, technical section on cross-linkers, pages 155- 
200, incorporated here by reference) which can be used to attach cysteine containing 

20 proteinaceous agents to the support. Alternatively, an amino group on the candidate 

agent may be used for attachment to an amino group on the surface. For example, a large 
number of stable bifunctional groups are well known in the art, including 
homobi functional and heterobifunctional linkers (see Pierce Catalog and Handbook, 
pages 155-200). In an additional embodiment, carboxyl groups (either from the surface 

25 or from the candidate agent) may be derivatized using well known linkers (see Pierce 
catalog). For example, carbodiimides activate carboxyl groups for attack by good 
nucleophiles such as amines (see Torchilin et al., Critical Rev, Tfiereapeutic Drug 
Carrier Systems, 7(4):275-308 (1991 )). Proteinaceous candidate agents may also be 
attached using other techniques known in the art, for example for the attachment of 

30 antibodies to polymers; see Slinkin et al., Bioconj. Chem. 2:342-348 (1991); Torchilin et 
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al., supra; Trubetskoy ct aL, Bioconj. Client. 3:323-327 (1992); King et al., Cancer Res. 
54:6176-6185 (1994); and Wilbur et al., Bioconjugate Chem. 5:220-235 (1994). It should 
be understood that the candidate agents may be attached in a variety of ways, including 
those listed above. Preferably, the manner of attachment does not significantly alter the 
5 functionality of the candidate agent; that is, the candidate agent should be attached in 
such a flexible manner as to allow its interaction with a target. 

Specific techniques for immobilizing enzymes on beads are known in the prior 
art. In one case, NH 2 surface chemistry beads are used. Surface activation is achieved 
with a 2.5% glutaraldehyde in phosphate buffered saline (10 mM) providing a pH of 6.9 

10 (138 mM NaCl, 2.7 mM KC1). This mixture is stirred on a stir bed for approximately 2 
hours at room temperature. The beads are then rinsed with ultrapure water plus 0.01% 
Tween 20 (surfactant) -0.02%, and rinsed again with a pH 7.7 PBS plus 0.01% tween 20. 
Finally, the enzyme is added to the solution, preferably after being prefiltered using a 
0.45 urn amicon micropure filter. 

15 The population of mobile solid supports are disposed in the reaction chambers. In 

some embodiments, 5% to 20% of the reaction chambers can have a mobile solid support 
with at least one reagent immobilized thereon, 20% to 60% of the reaction chambers can 
have a mobile solid support with at least one reagent immobilized thereon or 50% to 
100% of the reaction chambers can have a mobile solid support with at least one reagent 

20 immobilized thereon. Preferably, at least one reaction chamber has a mobile solid 
support having at least one reagent immobilized thereon and the reagent is suitable for 
use in a nucleic acid sequencing reaction. 

In some embodiments, the reagent immobilized to the mobile solid support can be 
a polypeptide with sulfurylase activity, a polypeptide with luciferase activity or a 

25 chimeric polypeptide having both sulfurylase and luciferase activity. In one embodiment, 
it can be a ATP sulfurylase and luciferase fusion protein. Since the product of the 
sulfurylase reaction is consumed by luciferase, proximity between these two enzymes 
may be achieved by covalently linking the two enzymes in the form of a fusion protein. 
This invention would be useful not only in substrate channeling but also in reducing 

30 production costs and potentially doubling the number of binding sites on streptavidin- 
coated beads. 
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In another embodiment, the sulfurylase is a thermostable ATP sulfurylase. In a 
preferred embodiment, the thermostable sulfurylase is active at temperatures above 
ambient (to at least 50°C). In one embodiment, the ATP sulfurylase is from a 
thermophile. In an additional embodiment, the mobile solid support can have a first 
5 reagent and a second reagent immobilized thereon, the first reagent is a polypeptide with 
sulfurylase activity and the second reagent is a polypeptide with luciferase activity. 

In another embodiment, the reagent immobilized to the mobile solid support can 
be a nucleic acid; preferably the nucleic acid is a single stranded concatamer. In a 
preferred embodiment, the nucleic acid can be used for sequencing a nucleic acid, e.g., a 
1 0 pyrosequencing reaction. 

The invention also provides a method for detecting or quantifying ATP activity 
using a mobile solid support; preferably the ATP can be detected or quantified as part of 
a nucleic acid sequencing reaction. 

A picotiter plate that has been "carpeted" with mobile solid supports with either 
15 nucleic acid or reagent enzymes attached thereto is shown as Figure 1 5. 

3. Methods of Sequencing Nucleic Acids 

Pyrophosphate-based sequencing is then performed. The sample DNA sequence 
and the extension primer are then subjected to a polymerase reaction in the presence of a 
nucleotide triphosphate whereby the nucleotide triphosphate will only become 
20 incorporated and release pyrophosphate (PPi) if it is complementary to the base in the 
target position, the nucleotide triphosphate being added either to separate aliquots of 
sample-primer mixture or successively to the same sample-primer mixture. The release 
of PPi is then detected to indicate which nucleotide is incorporated. 

In one embodiment, a region of the sequence product is determined by annealing 
25 a sequencing primer to a region of the template nucleic acid, and then contacting the 
sequencing primer with a DNA polymerase and a known nucleotide triphosphate, /.e., 
dATP, dCTP, dGTP, dTTP, or an analog of one of these nucleotides. The sequence can 
be determined by detecting a sequence reaction byproduct, as is described below. 
The sequence primer can be any length or base composition, as long as it is 
30 capable of specifically annealing to a region of the amplified nucleic acid template. No 
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particular structure for the sequencing primer is required so long as it is able to 
specifically prime a region on the amplified template nucleic acid. Preferably, the 
sequencing primer is complementary to a region of the template that is between the 
sequence to be characterized and the sequence hybridizable to the anchor primer. The 
5 sequencing primer is extended with the DNA polymerase to form a sequence product. 
The extension is performed in the presence of one or more types of nucleotide 
triphosphates, and if desired, auxiliary binding proteins. 

Incorporation of the dNTP is preferably determined by assaying for the presence 
of a sequencing byproduct. In a preferred embodiment, the nucleotide sequence of the 

10 sequencing product is determined by measuring inorganic pyrophosphate (PPi) liberated 
from a nucleotide triphosphate (dNTP) as the dNMP is incorporated into an extended 
sequence primer. This method of sequencing, termed Pyrosequencing™ technology 
(PyroSequencing AB, Stockholm, Sweden) can be performed in solution (liquid phase) or 
as a solid phase technique. PPi-bascd sequencing methods are described generally in, 

15 e.g., W098 13523 Al, Ronaghi, et al. 1996. Anal. Biochem. 242: 84-89, Ronaghi, et a/., 
1998. Science 281: 363-365 (1998) and USSN 2001/0024790. These disclosures of PPi 
sequencing are incorporated herein in their entirety, by reference. See also , e.g., US 
patents 6,210,891 and 6,258,568, each fully incorporated herein by reference. 

Pyrophosphate released under these conditions can be detected enzymatically 

20 (e.g., by the generation of light in the luciferase-Iuciferin reaction). Such methods enable 
a nucleotide to be identified in a given target position, and the DNA to be sequenced 
simply and rapidly while avoiding the need for electrophoresis and the use of potentially 
dangerous radiolabels. 

PPi can be detected by a number of different methodologies, and various 

25 enzymatic methods have been previously described (see e.g.. Reeves, et al, 1969. Anal. 
Biochem. 28: 282-287; Guillory, et al, 1971. Anal Biochem. 39: 170-1 80; Johnson, et 
al., 1968. Anal. Biochem. 15: 273; Cook, et al, 1978. Anal Biochem. 91: 557-565;and 
Drake, et al, 1979. Anal Biochem. 94: 1 17-120). 

PPi liberated as a result of incorporation of a dNTP by a polymerase can be 

30 converted to ATP using, e.g., an ATP sulfurylase. This enzyme has been identified as 
being involved in sulfur metabolism. Sulfur, in both reduced and oxidized forms, is an 
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essential mineral nutrient for plant and animal growth (see e.g., Schmidt and Jager, 1992. 
Ann. Rev. Plant Physiol. Plant Mol Biol. 43: 325-349). In both plants and 
microorganisms, active uptake of sulfate is followed by reduction to sulfide. As sulfate 
has a very low oxidation/reduction potential relative to available cellular reductants, the 
5 primary step in assimilation requires its activation via an ATP-dependent reaction (see 
e.g., Leyh, 1993. CriL Rev. Biochem. Mol Biol 28: 515-542). ATP sulfurylasc (ATP: 
sulfate adenylyltransferase; EC 2.7.7.4) catalyzes the initial reaction in the metabolism of 
inorganic sulfate (SO/ 2 ); see e.g., Robbins and Lipmann, 1958. J. Biol Client. 233: 686- 
690; Hawes and Nicholas, 1973. Biochem. J. 133: 541-550). In this reaction SO/ 2 is 

10 activated to adenosine 5'-phosphosulfate (APS). 

ATP sulfurylase has been highly purified from several sources, such as 
Saccharomyces cerevisiae (see e.g., Hawes and Nicholas, 1973. Biochem. J. 133: 541- 
550); Penicillium chrysogenum (see e.g., Renosto, et al, 1990. Biol Chem. 265: 
10300-10308); rat liver (see e.g.. Yu, et al, 1989. Arch. Biochem. Biophys. 269: 165- 

15 174); and plants (see e.g., Shaw and Anderson, 1972. Biochem. J. 127: 237-247; 

Osslund, <?/<//.. 1982. Plant Physiol 70: 39-45). Furthermore, ATP sulfurylase genes 
have been cloned from prokaryotes (see e.g., Leyh, et al, 1992. J. Biol Chem. 267: 
10405-10410; Schwedock and Long, 1989. Mol Plant Microbe Interaction 2: 181-194; 
Laue and Nelson, 1994. J. Bacteriol 176: 3723-3729); eukaryotes (see e.g., Cherest, et 

20 al, 1987. Mol. Gen. Genet. 210: 307-313; Mountain and Korch, 1991. Yeast 7: 873-880; 
Foster, et al, 1994. J. Biol Chem. 269: 19777-19786); plants (see e.g., Leustek, et al, 
1994. Plant Physiol 105: 897-90216); and animals (see e.g., Li, et al, 1995. J. Biol 
Chem. 270: 29453-29459). The enzyme is a homo-oligomer or heterodimer, depending 
upon the specific source (see e.g., Leyh and Suo, 1992. J. Biol Chem. 267: 542-545). 

25 In some embodiments, a thermostable sulfurylase is used. Thermostable 

sulfurylases can be obtained from, e.g., Archaeoglobus or Pyrococcus spp. Sequences of 
thermostable sulfurylases are available at database Acc. No. 028606, Acc. No. Q9YCR4, 
and Acc. No. P56863. 

ATP sulfurylase has been used for many different applications, for example, 

30 bioluminomctric detection of ADP at high concentrations of ATP (see e.g., Schultz, et 
al, 1993. Anal. Biochem. 215: 302-304); continuous monitoring of DNA polymerase 
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activity (see e.g., Nyrbn, 1987. Anal. Biochem. 167: 235-238); and DNA sequencing (see 
e.g., Ronaghi, et al, 1996. Anal Biochenu 242: 84-89; Ronaghi, et al, 1998. Science 
281: 363-365; Ronaghi, et al, 1998. Anal Biochem. 267: 65-71). 

Several assays have been developed for detection of the forward ATP sulfurylase 
5 reaction. The colorimetric molybdolysis assay is based on phosphate detection (see e.g., 
Wilson and Bandurski, 1958. J. Biol Chem. 233: 975-981), whereas the continuous 
spectrophotometric molybdolysis assay is based upon the detection of NADH oxidation 
(see e.g., Seubert, et al, 1983. Arch. Biochem. Biophys. 225: 679-691; Seubert, et al, 
1985. Arch. Biochem. Biophys. 240: 509-523). The later assay requires the presence of 

10 several detection enzymes. In addition, several radioactive assays have also been 

described in the literature (see e.g., Daley, et al, 1986. Anal Biochem. 157: 385-395). 
For example, one assay is based upon the detection of 32 PPi released from 32 P-labeled 
ATP (see e.g., Seubert, et al, 1985. Arch. Biochem. Biophys. 240: 509-523) and another 
on the incorporation of 35 S into [ 35 S]-labeled APS (this assay also requires purified APS 

15 kinase as a coupling enzyme; see e.g., Seubert, et al, 1983. Arch. Biochem. Biophys. 225: 
679-691); and a third reaction depends upon the release of 35 SCV 2 from [ 35 S]-Iabeled 
APS (see e.g., Daley, et al, 1986. Anal Biochem. 157: 385-395). 

For detection of the reversed ATP sulfurylase reaction a continuous 
spectrophotometric assay (see e.g., Segel, etal, 1987 Methods Enzymol 143: 334-349); 

20 a bioluminometric assay (see e.g., Balharry and Nicholas, 1971 . Anal Biochem. 40: 1- 
17); an 35 S04~ 2 release assay (see e.g., Seubert, et al, 1985. Arch. Biochem. Biophys. 240: 
509-523); and a 32 PPi incorporation assay (see e.g., Osslund, et al, 1982. Plant Physiol 
70: 39-45) have been previously described 

ATP produced by an ATP sulfurylase can be hydrolyzcd using enzymatic 

25 reactions to generate light. Light-emitting chemical reactions (i.e., chemi luminescence) 
and biological reactions (i.e., bioluminescence) are widely used in analytical 
biochemistry for sensitive measurements of various metabolites. In bioluminescent 
reactions, the chemical reaction that leads to the emission of light is enzyme-catalyzed. 
For example, the luciferin-luciferasc system allows for specific assay of ATP and the 

30 bacterial luciferase-oxidoreductase system can be used for monitoring of NAD(P)H. 
Both systems have been extended to the analysis of numerous substances by means of 
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coupled reactions involving the production or utilization of ATP or NAD(P)H (see e.g., 
Kricka, 1991. Chemi luminescent and bioluminescent techniques. Clin. Chem. 37: 1472- 
1281). 

The development of new reagents have made it possible to obtain stable light 
5 emission proportional to the concentrations of ATP (see e.g., Lundin, 1982. Applications 
of firefly luciferasc In; Luminescent Assays (Raven Press, New York) or NAD(P)H (see 
e.g., Lovgren, et al. Continuous monitoring of NADH-converting reactions by bacterial 
luminescence. J. Appl. Biochem. 4: 103-1 1 1). With such stable light emission reagents, 
it is possible to make endpoint assays and to calibrate each individual assay by addition 

10 of a known amount of ATP or NAD(P)H. In addition, a stable light-emitting system also 
allows continuous monitoring of ATP- or NAD(P)H-converting systems. 

Suitable enzymes for converting ATP into light include luciferases, e.g., insect 
luciferases. Luciferases produce light as an end-product of catalysis. The best known 
light-emitting enzyme is that of the firefly, Photinus pyralis (Coleoptera). The 

15 corresponding gene has been cloned and expressed in bacteria (see e.g., de Wet, et al, 
1985. Proc. Natl. Acad. Sci. USA 80: 7870-7873) and plants (see e.g., Ow, et al., 1986. 
Science 234: 856-859), as well as in insect (see e.g., Jha, et al, 1 990. FEBS Lett. 274: 
24-26) and mammalian cells (see e.g., de Wet, et al., 1987. Mol. Cell. Biol. 7: 725-7373; 
Keller, et al, 1987. Proc. Natl. Acad. Sci. USA 82: 3264-3268). In addition, a number of 

20 luciferase genes from the Jamaican click beetle, Pyroplorus plagiophihalamus 

{Coleoptera), have recently been cloned and partially characterized (see e.g., Wood, et 
al, 1989.7. Biolumin. Chemilwnin. 4: 289-301; Wood, et al. 1989. Science 244: 700- 
702). Distinct luciferases can sometimes produce light of different wavelengths, which 
may enable simultaneous monitoring of light emissions at different wavelengths. 

25 Accordingly, these aforementioned characteristics are unique, and add new dimensions 
with respect to the utilization of current reporter systems. 

Firefly luciferase catalyzes bioluminesccnce in the presence of lucifcrin, 
adenosine 5' -triphosphate (ATP), magnesium ions, and oxygen, resulting in a quantum 
yield of 0.88 (see e.g., McElroy and Selinger, 1960. Arch. Biochem. Biophys. 88: 136- 

30 145). The firefly luciferase bioluminescent reaction can be utilized as an assay for the 
detection of ATP with a detection limit of approximately lxlO" 13 M (sec e.g., Leach, 
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1981. J. AppL Biochenu 3: 473-517). In addition, the overall degree of sensitivity and 
convenience of the luciferase-mcdiated detection systems have created considerable 
interest in the development of firefly luciferase-based biosensors (see e.g., Green and 
Kricka, 1984. TalantaZX: 173-176; Blum, etaL 1989. J. Biolumin. Chemilumin. 4: 
5 543-550). 

Using the above-described enzymes, the sequence primer is exposed to a 
polymerase and a known dNTP. If the dNTP is incorporated onto the 3' end of the 
primer sequence, the dNTP is cleaved and a PPi molecule is liberated. The PPi is then 
converted to ATP with ATP sulfurylase. Preferably, the ATP sulfiirylase is present at a 

10 sufficiently high concentration that the conversion of PPi proceeds with first-order 
kinetics with respect to PPi. In the presence of luciferase, the ATP is hydrolyzed to 
generate a photon. The reaction preferably has a sufficient concentration of luciferase 
present within the reaction mixture such that the reaction, ATP -» ADP + PO4 3 ' + photon 
(light), proceeds with first-order kinetics with respect to ATP. The photon can be 

15 measured using methods and apparatuses described below. In one embodiment, the PPi 
and a coupled sulfurylase/luciferase reaction is used to generate light for detection. In 
some embodiments, either or both the sulfurylase and luciferase are immobilized on one 
or more mobile solid supports disposed at each reaction site. 

The present invention thus permits PPi release to be detected during the 

20 polymerase reaction giving a real-time signal. The sequencing reactions may be 

continuously monitored in real-time. A procedure for rapid detection of PPi release is 
thus enabled by the present invention. The reactions have been estimated to take place in 
less than 2 seconds (Nyren and Lundin, supra). The rate limiting step is the conversion 
of PPi to ATP by ATP sulfurylase, while the luciferase reaction is fast and has been 

25 estimated to take less than 0.2 seconds. Incorporation rates for polymerases have also 
been estimated by various methods and it has been found, for example, that in the case of 
Klenow polymerase, complete incorporation of one base may take less than 0.5 seconds. 
Thus, the estimated total time for incorporation of one base and detection by this 
enzymatic assay is approximately 3 seconds. It will be seen therefore that very fast 

30 reaction times arc possible, enabling real-time detection. The reaction times could further 
be decreased by using a more thermostable luciferase. 
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For most applications it is desirable to use reagents free of contaminants like ATP 
and PPi. These contaminants may be removed by flowing the reagents through a pre- 
column containing apyrase andAor pyrophosphatase bound to resin. Alternatively, the 
apyrase or pyrophosphatase can be bound to magnetic beads and used to remove 
5 contaminating ATP and PPi present in the reagents. In addition it is desirable to wash 
away diffusible sequencing reagents, e.g., unincorporated dNTPs, with a wash buffer. 
Any wash buffer used in pyrophosphate sequencing can be used. 

In some embodiments, the concentration of reactants in the sequencing reaction 
include 1 pmol DNA, 3 pmol polymerase, 40 pmol dNTP in 0.2 ml buffer. See Ronaghi, 

10 et ai 9 Anal Biochem. 242: 84-89 (1996). 

The sequencing reaction can be performed with each of four predetermined 
nucleotides, if desired. A "complete" cycle generally includes sequentially administering 
sequencing reagents for each of the nucleotides dATP, dGTP, dCTP and dTTP (or 
dUTP), in a predetermined order. Unincorporated dNTPs are washed away between each 

15 of the nucleotide additions. Alternatively, unincorporated dNTPs are degraded by 
apyrase (see below). The cycle is repeated as desired until the desired amount of 
sequence of the sequence product is obtained. In some embodiments, about 10-1000, 10- 
100, 10-75, 20-50, or about 30 nucleotides of sequence information is obtained from 
extension of one annealed sequencing primer. 

20 In some embodiments, the nucleotide is modified to contain a disul fide-derivative 

of a hapten such as biotin. The addition of the modified nucleotide to the nascent primer 
annealed to the anchored substrate is analyzed by a post-polymerization step that includes 
i) sequentially binding of, in the example where the modification is a biotin, an avidin- or 
streptavidin-conjugated moiety linked to an enzyme molecule, ii) the washing away of 

25 excess avidin- or streptavidin-linked enzyme, iii) the flow of a suitable enzyme substrate 
under conditions amenable to enzyme activity, and iv) the detection of enzyme substrate 
reaction product or products. The hapten is removed in this embodiment through the 
addition of a reducing agent. Such methods enable a nucleotide to be identified in a 
given target position, and the DNA to be sequenced simply and rapidly while avoiding 

30 the need for electrophoresis and the use of potentially dangerous radiolabcls. 
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A preferred enzyme for detecting the hapten is horse-radish peroxidase. If 
desired, a wash buffer, can be used between the addition of various reactants herein. 
Apyrase can be used to remove unreacted dNTP used to extend the sequencing primer. 
The wash buffer can optionally include apyrase. 
5 Example haptens, e.g., biotin, digoxygenin, the fluorescent dye molecules cy3 and 

cy5, and fluorescein, are incorporated at various efficiencies into extended DNA 
molecules. The attachment of the hapten can occur through linkages via the sugar, the 
base, and via the phosphate moiety on the nucleotide. Example means for signal 
amplification include fluorescent, electrochemical and enzymatic. In a preferred 

10 embodiment using enzymatic amplification, the enzyme, e.g. alkaline phosphatase (AP), 
horse-radish peroxidase (HRP), beta-galactosidase, luciferase, can include those for 
which light-generating substrates are known, and the means for detection of these light- 
generating (chemiluminescent) substrates can include a CCD camera. 

In a preferred mode, the modified base is added, detection occurs, and the hapten- 

15 conjugated moiety is removed or inactivated by use of either a cleaving or inactivating 
agent. For example, if the clcavable-linker is a disulfide, then the cleaving agent can be a 
reducing agent, for example dithiothreitol (DTT), beta-mercaptoethanol, etc. Other 
embodiments of inactivation include heat, cold, chemical denaturants, surfactants, 
hydrophobic reagents, and suicide inhibitors. 

20 Luciferase can hydrolyze dATP directly with concomitant release of a photon. 

This results in a false positive signal because the hydrolysis occurs independent of 
incorporation of the dATP into the extended sequencing primer. To avoid this problem, a 
dATP analog can be used which is incorporated into DNA, i.e., it is a substrate for a 
DNA polymerase, but is not a substrate for luciferase. One such analog is a-thio-dATP. 

25 Thus, use of a-thio-dATP avoids the spurious photon generation that can occur when 
dATP is hydrolyzed without being incorporated into a growing nucleic acid chain. 

Typically, the PPi-based detection is calibrated by the measurement of the light 
released following the addition of control nucleotides to the sequencing reaction mixture 
immediately after the addition of the sequencing primer. This allows for normalization of 

30 the reaction conditions. Incorporation of two or more identical nucleotides in succession 
is revealed by a corresponding increase in the amount of light released. Thus, a two-fold 
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increase in released light relative to control nucleotides reveals the incorporation of two 
successive dNTPs into the extended primer. 

If desired, apyrase may be "washed" or "flowed" over the surface of the solid 
support so as to facilitate the degradation of any remaining, non-incorporated dNTPs 
5 within the sequencing reaction mixture. Apyrase also degrades the generated ATP and 
hence "turns off' the light generated from the reaction. Upon treatment with apyrase, any 
remaining reactants are washed away in preparation for the following dNTP incubation 
and photon detection steps. Alternatively, the apyrase may be bound to the solid or 
mobile solid support. 

10 

Double Ended Sequencing 

In a preferred embodiment we provide a method for sequencing from both ends of 
a nucleic acid template. Traditionally, the sequencing of two ends of a double stranded 
DNA molecule would require at the very least the hybridization of primer, sequencing of 

1 5 one end, hybridization of a second primer, and sequencing of the other end. The 

alternative method is to separate the individual strands of the double stranded nucleic 
acid and individually sequence each strand. The present invention provides a third 
alternative that is more rapid and less labor intensive than the first two methods. 

The present invention provides for a method of sequential sequencing of nucleic 

20 acids from multiple primers. References to DNA sequencing in this application are 
directed to sequencing using a polymerase wherein the sequence is determined as the 
nucleotide triphosphate (NTP) is incorporated into the growing chain of a sequencing 
primer. One example of this type of sequencing is the pyro-sequencing detection 
pyrophosphate method (see, e.g., U.S. Patents 6,274,320, 6258,568 and 6,210,891, each 

25 of which is incorporated in total herein by reference.). 

In one embodiment, the present invention provides for a method for sequencing 
two ends of a template double stranded nucleic acid. The double stranded DNA is 
comprised of two single stranded DNA; referred to herein as a first single stranded DNA 
and a second single stranded DNA. A first primer is hybridized to the first single 

30 stranded DNA and a second primer is hybridized to the second single stranded DNA. 
The first primer is unprotected while the second primer is protected. "Protection" and 
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"protected*' are defined in this disclosure as being the addition of a chemical group to 
reactive sites on the primer that prevents a primer from polymerization by DNA 
polymerase. Further, the addition of such chemical protecting groups should be 
reversible so that after reversion, the now deprotected primer is once again able to serve 
5 as a sequencing primer. The nucleic acid sequence is determined in one direction (e.g., 
from one end of the template) by elongating the first primer with DNA polymerase using 
conventional methods such as pyrophosphate sequencing. The second primer is then 
deprotected, and the sequence is determined by elongating the second primer in the other 
direction (e.g., from the other end of the template) using DNA polymerase and 

10 conventional methods such as pyrophosphate sequencing. The sequences of the first and 
second primers are specifically designed to hybridize to the two ends of the double 
stranded DNA or at any location along the template in this method. 

In another embodiment, the present invention provides for a method of 
sequencing a nucleic acid from multiple primers. In this method a number of sequencing 

15 primers are hybridized to the template nucleic acid to be sequenced. All the sequencing 
primers are reversibly protected except for one. A protected primer is an oligonucleotide 
primer that cannot be extended with polymerase and dNTPs which are commonly used in 
DNA sequencing reactions. A reversibly protected primer is a protected primer which 
can be deprotected. All protected primers referred to in this invention are reversibly 

20 protected. After deprotection, a reversibly protected primer functions as a normal 
sequencing primer and is capable of participating in a normal sequencing reaction. 

The present invention provides for a method of sequential sequencing a nucleic 
acid from multiple primers. The method comprises the following steps: First, one or 
more template nucleic acids to be sequenced are provided. Second, a plurality of 

25 sequencing primers are hybridized to the template nucleic acid or acids. The number of 
sequencing primers may be represented by the number n where n can be any positive 
number greater than 1 . That number may be, for example, 2, 3, 4, 5, 6, 7, 8, 9, 10 or 
greater. Of the primers, n-1 number may be protected by a protection group. So, for 
example, if n is 2, 3, 4, 5, 6, 7, 8, 9 or 10, n-1 would be 1, 2, 3, 4, 5, 6, 7, 8, 9 

30 respectively. The remaining primer (e.g., n number primers - (n-1) number of protected 
primers = one remaining primer) is unprotected. Third, the unprotected primer is 
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extended and the template DNA sequence is determined by conventional methods such 
as, for example, pyrophosphate sequencing. Fourth, after the sequencing of the first 
primer, one of the remaining protected primers is unprotected. Fifth, unprotected primer 
is extended and the template DNA sequence is determined by conventional methods such 
5 as, for example, pyrophosphate sequencing. Optionally, the method may be repeated 
until sequencing is performed on all the protected primers. 

In another aspect, the present invention includes a method of sequential 
sequencing of a nucleic acid comprising the steps of: (a) hybridizing 2 or more 
sequencing primers to the nucleic acid wherein all the primers except for one are 

10 reversibly protected; (b) determining a sequence of one strand of the nucleic acid by 
polymerase elongation from the unprotected primer; (c) deprotecting one of the 
reversibly protected primers into an unprotected primer; (d) repeating steps (b) and (c) 
until all the reversibly protected primers are deprotected and used for determining a 
sequence. In one embodiment, this method comprises one additional step between steps 

15 (b) and (c), i.e., the step of terminating the elongation of the unprotected primer by 

contacting the unprotected primer with DNA polymerase and one or more of a nucleotide 
triphosphate or a dideoxy nucleotide triphosphate. In yet another embodiment, this 
method further comprises an additional step between said step (b) and (c), i.e., 
terminating the elongation of the unprotected primer by contacting the unprotected 

20 primer with DNA polymerase and a dideoxy nucleotide triphosphate from ddATP, 
ddTTP, ddCTP, ddGTP or a combination thereof. 

In another aspect, this invention includes a method of sequencing a nucleic acid 
comprising: (a) hybridizing a first unprotected primer to a first strand of the nucleic acid; 
(b) hybridizing a second protected primer to a second strand; (c) exposing the first and 

25 second strands to polymerase, such that the first unprotected primer is extended along the 
first strand; (d) completing the extension of the first sequencing primer; (e) deprotecting 
the second sequencing primer; and (f) exposing the first and second strands to 
polymerase so that the second sequencing primer is extended along the second strand. In 
a preferred embodiment, completing comprises capping or terminating the elongation. 

30 hi another embodiment, the present invention provides for a method for 

sequencing two ends of a template double stranded nucleic acid that comprises a first and 
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a second single stranded DNA. In this embodiment, a first primer is hybridized to the 
first single stranded DNA and a second primer is hybridized to the second single stranded 
DNA in the same step. The first primer is unprotected while the second primer is 
protected. 

5 Following hybridization, the nucleic acid sequence is determined in one direction 

(eg., from one end of the template) by elongating the first primer with DNA polymerase 
using conventional methods such as pyrophosphate sequencing. In a preferred 
embodiment, the polymerase is devoid of 3' to 5' exonuclease activity. The second 
primer is then deprotected, and its sequence is determined by elongating the second 

10 primer in the other direction (e.g., from the other end of the template) with DNA 
polymerase using conventional methods such as pyrophosphate sequencing. As 
described earlier, the sequences of the first primer and the second primer are designed to 
hybridize to the two ends of the double stranded DNA or at any location along the 
template. This technique is especially useful for the sequencing of many template DNAs 

15 that contain unique sequencing primer hybridization sites on its two ends. For example, 
many cloning vectors provide unique sequencing primer hybridization sites flanking the 
insert site to facilitate subsequent sequencing of any cloned sequence (eg., Bluescript, 
Stratagene, La Jolla, CA). 

One benefit of this method of the present invention is that both primers may be 

20 hybridized in a single step. The benefits of this and other methods are especially useful 
in parallel sequencing systems where hybridizations are more involved than normal. 
Examples of parallel sequencing systems are disclosed in copending U.S. patent 
application serial number 10/104,280, the disclosure of which is incorporated in total 
herein. 

25 The oligonucleotide primers of the present invention may be synthesized by 

conventional technology, e.g., with a commercial oligonucleotide synthesizer and/or by 

ligating together subfragments that have been so synthesized. 

In another embodiment of the invention, the length of the double stranded target 

nucleic acid may be determined. Methods of determining the length of a double stranded 
30 nucleic acid are known in the art. The length determination may be performed before or 

after the nucleic acid is sequenced. Known methods of nucleic acid molecule length 
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determination include gel electrophoresis, pulsed field gel electrophoresis, mass 
spectroscopy and the like. Since a blunt ended double stranded nucleic acid is comprised 
of two single strands of identical lengths, the determination of the length of one strand of 
a nucleic acid is sufficient to determine the length of the corresponding double strand. 
5 The sequence reaction according to the present invention also allows a 

determination of the template nucleic acid length. First, a complete sequence from one 
end of the nucleic acid to another end will allow the length to be determined. Second, the 
sequence determination of the two ends may overlap in the middle allowing the two 
sequences to be linked. The complete sequence may be determined and the length may 

10 be revealed. For example, if the template is 100 bps long, sequencing from one end may 
determine bases 1 to 75; sequencing from the other end may determine bases 25 to 100; 
there is thus a 51 base overlap in the middle from base 25 to base 75; and from this 
information, the complete sequence from 1 to 100 may be determined and the length, of 
100 bases, may be revealed by the complete sequence. 

15 Another method of the present invention is directed to a method comprising the 

following steps. First a plurality of sequencing primers, each with a different sequence, 
is hybridized to a DNA to be sequenced. The number of sequencing primers may be any 
value greater than one such as, for example, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more. All of these 
primers are reversibly protected except for one. The one unprotected primer is elongated 

20 in a sequencing reaction and a sequence is determined. Usually, when a primer is 

completely elongated, it cannot extend and will not affect subsequent sequencing from 
another primer. If desired, the sequenced primer may be terminated using excess 
polymerase and dNTP or using ddNTPs. If a termination step is taken, the termination 
reagents (dNTPs and ddNTPs) should be removed after the step. Then, one of the 

25 reversibly protected primers is unprotected and sequencing from the second primer 
proceeds. The steps of deprotecting a primer, sequencing from the deprotected primer, 
and optionally, terminating sequencing from the primer is repeated until all the protected 
primers are unprotected and used in sequencing. 

The reversibly protected primers should be protected with different chemical 

30 groups. By choosing the appropriate method of deprotection, one primer may be 

deprotected without affecting the protection groups of the other primers. In a preferred 
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embodiment, the protection group is PO4. That is, the second primer is protected by P0 4 
and deprotection is accomplished by T4 polynucleotide kinase (utilizing its 3'- 
phosphatase activity). In another preferred embodiment, the protection is a thio group or 
a phosphorothiol group. 
5 The template nucleic acid may be a DNA, RN A, or peptide nucleic acid (PNA). 

While DNA is the preferred template, RNA and PNA may be converted to DNA by 
known techniques such as random primed PCR, reverse transcription, RT-PCR or a 
combination of these techniques. Further, the methods of the invention are usefiil for 
sequencing nucleic acids of unknown and known sequence. The sequencing of nucleic 

10 acid of known sequence would be useful, for example, for confirming the sequence of 
synthesized DNA or for confirming the identity of suspected pathogen with a known 
nucleic acid sequence. The nucleic acids may be a mixture of more than one population 
of nucleic acids. It is known that a sequencing primer with sufficient specificity (e.g., 20 
bases, 25 bases, 30 bases, 35 bases, 40 bases, 45 bases, or 50 bases) may be used to 

1 5 sequence a subset of sequences in a long nucleic acid or in a population of unrelated 
nucleic acids. Thus, for example, the template may be one sequence of 10 Kb or ten 
sequences of 1 Kb each. In a preferred embodiment, the template DNA is between 50 bp 
to 700 bp in length. The DNA can be single stranded or double stranded. 

In the case where the template nucleic acid is single stranded, a number of 

20 primers may be hybridized to the template nucleic acid as shown below: 

5--primer4--3' 5 ? -primer3 -3* 5'-primer 2 -3' 5' -primer 1-3' 

3' template nucleic acid 5' 

In this case, it is preferred that the initial unprotected primer would be the primer 
that hybridizes at the most 5' end of the template. See primer 1 in the above illustration. 

25 In this orientation, the elongation of primer 1 would not displace (by strand displacement) 
primer 2, 3, or 4. When sequencing from primer 1 is finished, primer 2 can be 
unprotected and nucleic acid sequencing can commence. The sequencing from primer 2 
may displace primer 1 or the elongated version of primer one but would have no effect on 
the remaining protected primers (primers 3 and 4). Using this order, each primer may be 

30 used sequentially and a sequencing reaction from one primer would not affect the 
sequencing from a subsequent primer. 

64 



E/M: EV262659147US 



21465-514 



One feature of the invention is the ability to use multiple sequencing primers on 
one or more nucleic acids and the ability to sequence from multiple primers using only 
one hybridization step. In the hybridization step, all the sequencing primers (e.g., the n 
number of sequencing primers) may be hybridized to the template nucleic acid(s) at the 
5 same time. In conventional sequencing, usually one hybridization step is required for 
sequencing from one primer. One feature of the invention is that the sequencing from n 
primers (as defined above) may be performed by a single hybridization step. This 
effectively eliminates n-1 hybridization step. 

In a preferred embodiment, the sequences of the n number of primers are 

10 sufficiently different that the primers do not cross hybridize or self-hybridize. Cross 
hybridization refers to the hybridization of one primer to another primer because of 
sequence complementarity. One form of cross hybridization is commonly referred to as a 
"primer dimer." In the case of a primer dimer, the 3' ends of two primers arc 
complementary and form a structure that when elongated, is approximately the sum of the 

1 5 length of the two primers. Self-hybridization refers to the situation where the 5' end of a 
primer is complementary to the 3* end of the primer. In that case, the primer has a 
tendency to self hybridize to form a hairpin-like structure. 

A primer can interact or become associated specifically with the template 
molecule. By the terms "interact" or "associate", it is meant herein that two substances or 

20 compounds (e.g., primer and template; chemical moiety and nucleotide) are bound (e.g., 
attached, bound, hybridized, joined, annealed, covalently linked, or otherwise associated) 
to one another sufficiently that the intended assay can be conducted. By the terms 
"specific" or "specifically", it is meant herein that two components bind selectively to 
each other. The parameters required to achieve specific interactions can be determined 

25 routinely, e.g., using conventional methods in the art. 

To gain more sensitivity or to help in the analysis of complex mixtures, the 
protected primers can be modified (e.g., derivatized) with chemical moieties designed to 
give clear unique signals. For example, each protected primer can be derivatized with a 
different natural or synthetic amino acid attached through an amide bond to the 

30 oligonucleotide strand at one or more positions along the hybridizing portion of the 
strand. The chemical modification can be detected, of course, either after having been 
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cleaved from the target nucleic acid, or while in association with the target nucleic acid. 
By allowing each protected target nucleic acid to be identified in a distinguishable 
manner, it is possible to assay (e.g., to screen) for a large number of different target 
nucleic acids in a single assay. Many such assays can be performed rapidly and easily. 
5 Such an assay or set of assays can be conducted, therefore, with high throughput 
efficiency as defined herein. 

In the methods of the invention, after a first primer is elongated and the sequence 
of the template DNA is determined, a second primer is deprotected and sequenced. There 
is no interference between the sequencing reaction of the first primer with the sequencing 

1 0 reaction of the second, now unprotected, primer because the first primer is completely 
elongated or terminated. Because the first primer is completely elongated, the 
sequencing from the second primer, using conventional methods such a pyrophosphate 
sequencing, will not be affected by the presence of the elongated first primer. The 
invention also provides a method of reducing any possible signal contamination from the 

15 first primer. Signal contamination refers to the incidences where the first primer is not 
completely elongated. In that case, the first primer will continue to elongate when a 
subsequent primer is deprotected and elongated. The elongation of both the first and 
second primers may interfere with the determination of DNA sequence. 

In a preferred embodiment, the sequencing reaction {e.g., the chain elongation 

20 reaction) from one primer is first terminated or completed before a sequencing reaction is 
started on a second primer. A chain elongation reaction of DNA can be terminated by 
contacting the template DNA with DNA polymerase and dideoxy nucleotide 
triphosphates (ddNTPs) such as ddATP, ddTTP, ddGTP and ddCTP. Following 
termination, the dideoxy nucleotide triphosphates may be removed by washing the 

25 reaction with a solution without ddNTPs. A second method of preventing further 

elongation of a primer is to add nucleotide triphosphates (dNTPs such as dATP, dTTP, 
dGTP and dCTP) and DNA polymerase to a reaction to completely extend any primer 
that is not completely extended. Following complete extension, the dNTPs and the 
polymerases are removed before the next primer is deprotected. By completing or 

30 terminating one primer before deprotecting another primer, the signal to noise ratio of the 
sequencing reaction (e.g., pyrophosphate sequencing) can be improved. 
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The steps of (a) optionally terminating or completing the sequencing, (b) 
deprotecting a new primer, and (c) sequencing from the deprotected primer may be 
repeated until a sequence is determined from the elongation of each primer. In this 
method, the hybridization step comprises "n" number of primers and one unprotected 
5 primer. The unprotected primer is sequenced first and the steps of (a), (b) and (c) above 
may be repeated. 

In a preferred embodiment, pyrophosphate sequencing is used for all sequencing 
conducted in accordance with the method of the present invention. 

In another preferred embodiment, the double ended sequencing is performed 

10 according to the process outlined in Figure 1 0. This process may be divided into six 
steps: (1) creation of a capture bead (Figure 10A); (2) drive to bead (DTB) PCR 
amplification (Figure 10B); (3) SL reporter system preparation (Figure 10C); (4) 
sequencing of the first strand (Figure 10D); (5) preparation of the second strand (Figures 
10E and 10F); and (6) analysis of each strand (Figure 10G). This exemplary process is 

15 outlined below. 

In step 1 , an N-hydroxysuccinimide (NHS)-activated capture bead (e.g., 
Amersham Biosciences, Piscataway, NJ) is coupled to both a forward primer and a 
reverse primer. NHS coupling forms a chemically stable amide bond with ligands 
containing primary amino groups. The capture bead is also coupled to biotin (Figure 

20 10A). The beads (i.e., solid nucleic acid capturing supports) used herein may be of any 
convenient size and fabricated from any number of known materials. Example of such 
materials include: inorganics, natural polymers, and synthetic polymers. Specific 
examples of these materials include: cellulose, cellulose derivatives, acrylic resins, glass; 
silica gels, polystyrene, gelatin, polyvinyl pyrrolidone, co-polymers of vinyl and 

25 acrylamide, polystyrene cross-linked with divinylbenzene or the like (see, Merrifield 
Biochemistry 1964, 3, 1385-1390), polyacrylamidcs, latex gels, polystyrene, dextran, 
rubber, silicon, plastics, nitrocellulose, celluloses, natural sponges, silica gels, glass, 
metals plastic, cellulose, cross-linked dexlrans (e.g., Sephadex IM ) and agarose gel 
(Sepharose 1 ™) and solid phase supports known to those of skill in the art. In a preferred 

30 embodiment, the capture beads are Sepharose beads approximately 25 to 40 jiM in 
diameter. 
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In step 2, template DNA which has hybridized to the forward and reverse primers 
is added, and the DNA is amplified through a PCR amplification strategy (Figure 10B). 
In one embodiment, the DNA is amplified by Emulsion Polymerase Chain Reaction, 
Drive to Bead Polymerase Chain Reaction, Rolling Circle Amplification or Loop- 
5 mediated Isothermal Amplification. In step 3, streptavidin is added followed by the 
addition of sulfurylase and luciferase which are coupled to the streptavidin (Figure IOC). 
The addition of auxiliary enzymes during a sequencing method has been disclosed in 
U.S.S.N. 10/104,280 and U.S. S.N. 10/127,906, which are incorporated herein in their 
entireties by reference. In one embodiment, the template DNA has a DNA adaptor 

10 ligated to both the 5' and 3' end. In a preferred embodiment, the DNA is coupled to the 
DNA capture bead by hybridization of one of the DNA adaptors to a complimentary 
sequence on the DNA capture bead. 

In the first step, single stranded nucleic acid template to be amplified is attached 
to a capture bead. The nucleic acid template may be attached to the capture bead in any 

15 manner known in the art. Numerous methods exist in the art for attaching the DNA to a 
microscopic bead. Covalent chemical attachment of the DNA to the bead can be 
accomplished by using standard coupling agents, such as water-soluble carbodiimide, to 
link the 5-phosphate on the DNA to amine-coated microspheres through a 
phosphoamidate bond. Another alternative is to first couple specific oligonucleotide 

20 linkers to the bead using similar chemistry, and to then use DNA ligase to link the DNA 
to the linker on the bead. Other linkage chemistries include the use of N- 
hydroxysuccinamide (NHS) and its derivatives, to join the oligonucleotide to the beads. 
In such a method, one end of the oligonucleotide may contain a reactive group (such as 
an amide group) which forms a covalent bond with the solid support, while the other end 

25 of the linker contains another reactive group which can bond with the oligonucleotide to 
be immobilized. In a preferred embodiment, the oligonucleotide is bound to the DNA 
capture bead by covalent linkage. However, non-covalent linkages, such as chelation or 
antigen-antibody complexes, may be used to join the oligonucleotide to the bead. 

Oligonucleotide linkers can be employed which specifically hybridize to unique 

30 sequences at the end of the DNA fragment, such as the overlapping end from a restriction 
enzyme site or the "sticky ends" of bacteriophage lambda based cloning vectors, but 
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blunt-end ligations can also be used beneficially. These methods are described in detail 
in US 5,674,743, the disclosure of which is incorporated in (oto herein. It is preferred 
that any method used to immobilize the beads will continue to bind the immobilized 
oligonucleotide throughout the steps in the methods of the invention. In a preferred 
5 embodiment, the oligonucleotide is bound to the DNA capture bead by covalent linkage. 
However, non-covalent linkages, such as chelation or antigen-antibody complexes, may 
be used to join the oligonucleotide to the bead. 

In step 4, the first strand of DNA is sequenced by depositing the capture beads 
onto a PicoTiter plate (PTP), and sequencing by a method known to one of ordinary skill 

10 in the art (eg., pyrophosphate sequencing) (Figure 10D). Following sequencing, a 
mixture of dNTPs and ddNTPs are added in order to "cap" or terminate the sequencing 
process (Figure 10E). In step 5, the second strand of nucleic acid is prepared by adding 
apyrase to remove the ddNTPs and polynucleotide kinase (PNK) to remove the 3' 
phosphate group from the blocked primer strand (Figure 10F). Polymerase is then added 

15 to prime the second strand followed by sequencing of the second strand according to a 
standard method known to one of ordinary skill in the art (Figure 10G). In step 7, the 
sequence of the both the first and second strand is analyzed such that a contiguous DNA 
sequence is determined. 

20 Detection means 

The solid support is optically linked to an imaging system 230, which includes a 
CCD system in association with conventional optics or a fiber optic bundle. In one 
embodiment the perfusion chamber substrate includes a fiber optic array wafer such that 
light generated near the aqueous interface is transmitted directly through the optical fibers 

25 to the exterior of the substrate or chamber. When the CCD system includes a fiber optic 
connector, imaging can be accomplished by placing the perfusion chamber substrate in 
direct contact with the connector. Alternatively, conventional optics can be used to 
image the light, e.g., by using a 1-1 magnification high numerical aperture lens system, 
from the exterior of the fiber optic substrate directly onto the CCD sensor. When the 

30 substrate does not provide for fiber optic coupling, a lens system can also be used as 



69 



E/M: EV262659147US 



21465-514 



described above, in which case either the substrate or the perfusion chamber cover is 
optically transparent. An exemplary CCD imaging system is described above. 

The imaging system 230 is used to collect light from the reactors on the substrate 
surface. Light can be imaged, for example, onto a CCD using a high sensitivity low 
5 noise apparatus known in the art. For fiber-optic based imaging, it is preferable to 

incorporate the optical fibers directly into the cover slip or for a FORA to have the optical 
fibers that form the microwells also be the optical fibers that convey light to the detector. 

The imaging system is linked to a computer control and data collection system 
240. In general, any commonly available hardware and software package can be used. 

10 The computer control and data collection system is also linked to the conduit 200 to 
control reagent delivery. 

The photons generated by the pyrophosphate sequencing reaction are captured by 
the CCD only if they pass through a focusing device (e.g., an optical lens or optical fiber) 
and are focused upon a CCD element. However, the emitted photons will escape equally 

15 in all directions. In order to maximize their subsequent "capture" and quantitation when 
utilizing a planar array (e.g., a DNA chip), it is preferable to collect the photons as close 
as possible to the point at which they are generated, e.g. immediately at the planar solid 
support. This is accomplished by either: (/) utilizing optical immersion oil between the 
cover slip and a traditional optical lens or optical fiber bundle or, preferably, 

20 (ii) incorporating optical fibers directly into the cover slip itself. Similarly, when a thin, 
optically transparent planar surface is used, the optical fiber bundle can also be placed 
against its back surface, eliminating the need to "image" through the depth of the entire 
reaction/perfusion chamber. 

The reaction event, e.g., photons generated by luciferase, may be detected and 

25 quantified using a variety of detection apparatuses, e.g., a photomultiplier tube, a CCD, 
CMOS, absorbance photometer, a luminometer, charge injection device (C1D), or other 
solid state detector, as well as the apparatuses described herein. In a preferred 
embodiment, the quantitation of the emitted photons is accomplished by the use of a 
CCD camera fitted with a fused fiber optic bundle. In another preferred embodiment, the 

30 quantitation of the emitted photons is accomplished by the use of a CCD camera fitted 
with a microchannel plate intensifier. A back-thinned CCD can be used to increase 
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sensitivity. CCD detectors are described in, e.g., Bronks, et ai, 1995. Anal Chem. 65: 
2750-2757. 

An exemplary CCD system is a Spectral Instruments, Inc. (Tucson, AZ) Series 
600 4-port camera with a Lockheed-Martin LM485 CCD chip and a 1-1 fiber optic 
5 connector (bundle) with 6-8 jim individual fiber diameters. This system has 4096 x 

4096, or greater than 16 million pixels and has a quantum efficiency ranging from 10% to 
> 40%. Thus, depending on wavelength, as much as 40% of the photons imaged onto the 
CCD sensor are converted to detectable electrons. 

In other embodiments, a fluorescent moiety can be used as a label and the 

10 detection of a reaction event can be carried out using a confocal scanning microscope to 
scan the surface of an array with a laser or other techniques such as scanning near-field 
optical microscopy (SNOM) are available which are capable of smaller optical resolution, 
thereby allowing the use of "more dense" arrays. For example, using SNOM, individual 
polynucleotides may be distinguished when separated by a distance of less than 100 nm, 

15 e.g., lOnm x lOnm. Additionally, scanning tunneling microscopy (Binning et ai., 

Helvetica Physica Acta, 55:726-735, 1982) and atomic force microscopy (Hanswa et al., 
Ann u Rev Biophys Biomol Struct, 23 : 1 1 5- 1 39, 1 994) can be used. 

Haplotypc Application 

20 Virtually any sequencing application can be accomplished using the methods and 

apparatus of this invention. In one embodiment we contemplate haplotype mapping. 
Human gene diversity is an important factor in the variability of patient response to 
pharmaceuticals. The most precise measurement of this diversity is the haplotype, which 
is the organization of polymorphic variation as it is found on a chromosome. Recently, 

25 major government and academic genome researchers in the US, Canada and Europe have 
agreed that haplotypes are a powerful tool that can reduce the complexity of genetic 
information to a practical form. Haplotypes can be used in drug discovery to improve the 
outcome of target validation and drug screening studies and in drug development to 
improve the design and reliability of clinical trials. Haplotype markers can be used to 

30 predict the efficacy and safety of new and approved drugs and will serve as the 
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foundation for a new paradigm of personalized medicine matching patients to the right 
drug at the right dose via guidance from a database of clinical marker-associations. 

Numerous empirical studies have shown that nearby SNP alleles are often in linkage 
disequilibrium (LD) with each other, such that the state of one SNP allele is often highly 
5 correlated with the allele of another close SNP. These correlations exist because of the 
shared history of tightly linked SNP's, which are co-transmitted from generation to 
generation. Patterns of human sequence variation (haplotypes) thus represent ancestral 
DNA segments. Historical meioses have slowly dissociated alleles from neighboring 
alleles on ancestral chromosomes, except for tightly linked variants. The extent of linkage 

10 disequilibrium in founder populations with recent bottlenecks had been the object of 
numerous studies - particularly in the cloning of simple Mendelian disorders disorders 
such as cystic fibrosis (16), Huntington's disease (11), diastrophic dysplasia (DTD) (8). 
Whereas these cloning studies benefited from the large chromosomal segments showing 
LD spanning over large distances (often in the megabase range), very little empirical data 

1 5 was available until recently regarding LD across the human genome in the world 
population. 

We focus on three recent examples of large-scale surveys of LD (and haplotypes): 
(see, e.g, Reich, D. E., Cargill, M., Bolk, S., Ireland, J., Sabeti, P. C, Richter, D. J., 
Lavery, T., Kouyoumjian, R., Farhadian, S. F., Ward, R. & Lander, E. S. 2001 . Linkage 

20 disequilibrium in the human genome. Nature 41 1, 199-204.26). We sampled 19 

chromosome regions for their SNP content. High frequency SNP's spanning intervals of 2 
to 160 kb were first genotyped in a Caucasian samples. Over all regions, LD was 
detectable at distances of about 60kb, with a significant difference between regions, as 
the range was as short as 6kb at one locus and as long 155 kb in another. Not 

25 surprisingly, LD was significantly correlated with the estimated local recombination 
rates. Further analysis in a Nigerian sample provided evidence of shorter LD in this 
population - although the allelic combinations over short distances were similar to the 
Caucasian sample. Overall - this work provided evidence that large blocks of LD are 
common across the human genome, and that genome-wide LD mapping of disease genes 

30 will be feasible. 
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Kits 

The invention also comprises kits for use in methods of the invention which could 
include one or more of the following components: (a) a test specific primer which 

5 hybridizes to sample DNA so that the target position is directly adjacent to the 3' end of 
the primer; (b) a polymerase; (c) detection enzyme means for identifying PPi release; 
(d) deoxynucleotides including, in place of dATP, a dATP analogue which is capable of 
acting as a substrate for a polymerase but incapable of acting as a substrate for a said PPi- 
detection enzyme; and (e) optionally dideoxynucleotides, optionally ddATP being 

1 0 replaced by a ddATP analogue which is capable of acting as a substrate for a polymerase 
but incapable of acting as a substrate for a said PPi-detection enzyme. If the kit is for use 
with initial PCR amplification then it could also include the following components: (i) a 
pair of primers for PCR, at least one primer having means permitting immobilization of 
said primer, (ii) a polymerase which is preferably heat stable, for example Taql 

1 5 polymerase; (iii) buffers for the PCR reaction; and (iv) deoxynucleotides. Where an 
enzyme label is used to evaluate PCR, the kit will advantageously contain a substrate for 
the enzyme and other components of a detection system. 
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EXAMPLES 



Example 1: Method of Sample Preparation 



5 Step 1: DNase I Digestion 

The purpose of the DNase I digestion step is to fragment a large stretch of DNA 
such as a whole genome or a large portion of a genome into smaller species. This 
population of smaller-sized DNA species generated from a single DNA template is 
referred to as a "library". Deoxyribonuclease I (DNase I) is an endonuclease which 

10 cleaves double-stranded template DNA. The cleavage characteristics of DNase I allow 
random digestion of template DNA (i.e., no sequence bias) and will result in the 
predominance of blunt-ended, double-stranded DNA fragments when used in the 
presence of manganese-based buffers (Melgar and Goldthwait 1968). The digestion of 
genomic templates by DNase 1 is dependent on three factors: i) quantity of enzyme used 

15 (units); ii) temperature of digestion (°C); and iii) incubation time (minutes). The DNase I 
digestion conditions outlined below have been optimized to yield DNA libraries in a size 
range from 50-700 base pairs (bp). 



1 . DNA was obtained and prepared to: 0.3 mg/ml in Tris-HCl (lOmM, pH 7-8). A 
20 total of 1 34 microliters of DNA was needed for this preparation. 

It is recommended to not use DNA preparations diluted with buffers containing 
EDTA (i.e., TE, Tris/EDTA). The presence of EDTA is inhibitory to enzyme 
digestion with DNase I. If the DNA preparation contains EDTA, it is essential that 
25 the DNA be "salted" out of solution and reconstituted with the appropriate Tris-HCl 

buffer (10 mM, pH 7-8) or nanopure H 2 0 (pH 7-8). 



30 



35 



2. In a 0.2 ml tube, DNase I Buffer, comprising 50 nl Tris pH 7.5 (1M), 10 pi MnCl 2 
(1M), 1 \x\ BSA (100 mg/ml), and 39 \x\ water was prepared. 

3. In a separate 0.2 ml tube, 15 ^1 of DNase I Buffer, and 1 .5 ^il of DNase I (lU/ml) 
was added. 

The reaction tube was placed in a thermal cycler set to 1 5°C. 

4. 134 jal of DNA (0.3 mg/ml) from step 1 was added to the DNase I reaction tube 
placed in the thermal cycler set at 1 5°C. The lid was closed and the reaction was 
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incubated for exactly 1 minute. Following incubation, 50 |al of 50 mM EDTA was 
added to stop the enzyme digestion. 

5. The digested DNA was purified by using the QiaQuick PCR purification kit. The 
5 digestion reaction was then split into four, and four spin columns were used to 

purify each fraction (37.5 \i\ per spin column). Each column was eluted with 30 |il 
elution buffer (EB) according to the procedure per manufacturer's protocol. The 
eluates were then combined to generate a final reaction volume of 120 

10 6. 3~jal aliquot of the digestion reaction was saved for analysis using a BioAnalzyer 
DNA 1000 LabChip. 

Step 2: Pfu Polishing 

Digestion of DNA templates with DNase I will yield fragments of DNA that are 

15 primarily blunt-ended, however, some fragments will have ends that contain protruding 

termini that are one or two nucleotides in length. Pfu polishing is used to increase the 

amount of blunt-ended species by fill-in (i.e., "blunting") of 5' overhangs. Pfu DNA 

polymerase has 3'-> 5' exonuclease activity that will result in the removal of single and 

double nucleotide extensions and does not possess DNA extendase activity. Pfu 

20 polishing will increase the amount of blunt-ended DNA fragments available for adaptor 

ligation (Costa 1994a, 1994b, 1994c). 

1 . In a 0.2 ml tube, 1 1 5 |il purified, DNase I-digested DNA fragments, 1 5 \i\ 1 OX 
Cloned Pfu buffer, 5 \i\ dNTPs (10 mM), and 15 \i\ cloned Pfu DNA polymerase 
(2.5 U/jil) were added in order. 



25 



The polishing reaction was then mixed well and incubated at 72* C for 30 
minutes. 



3. Following incubation the reaction tube was removed and placed on ice for 2 
30 minutes. 

4. The polishing reaction was then split into four QiaQuick PCR purification columns 
(37.5 ^iL each). Each column was eluted with 30 jil buffer EB according to the 
procedure per manufacturer's protocol. The eluates were then combined to 

35 generate a final reaction volume of 120 ^L. 

5. A 3-|al aliquot of the final polishing reaction was saved for analysis using a 
BioAnalzyer DNA 1 000 LabChip. 

40 Step 3: Ligation of Universal Adaptors to Fragmented DNA Library 
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Following fragmentation and polishing of the genomic DNA library, primer 
sequences are added to the ends of each DNA fragment. These primer sequences are 
termed "Universal Adaptors" and are comprised of double-stranded oligonucleotides that 
contain specific priming regions that afford both PCR amplification and nucleotide 
5 sequencing. The Universal Adaptors are designed to include a set of unique PCR 
priming regions that are 20 base pairs in length located adjacent to a set of unique 
sequencing priming regions that are 20 base pairs in length, followed by a unique 4-base 
"key" consisting of one of each deoxyribonucleotide (i.e., A, C, G, T). Each unique 
Universal Adaptor (termed "Universal Adaptor A" and "Universal Adaptor B") is forty- 
10 four base pairs (44-bp) in length. Universal Adaptors are ligated, using T4 DNA ligase, 
onto each end of the DNA fragment to generate a total nucleotide addition of 88-bp to 
each DNA fragment. Different Universal Adaptors are designed specifically for each 
genomic DNA library preparation and will therefore provide a unique identifier for each 
organism. 

1 5 To prepare a pair of Universal Adaptors, single-stranded oligonucleotides are 

designed in-house and are manufactured through a commercial vendor. Universal 
Adaptor DNA oligonucleotides are designed with two phosphorothioate linkages at each 
oligonucleotide end that serve to protect against nuclease activity (Samini, T.D., B. 
Jolles, and A. Laigle. 2001. Best minimally modified antisense oligonucleotides 

20 according to cell nuclease activity. Antisense Nucleic Acid Drug Dev. 1 1 (3): 1 29., the 
disclosure of which is incorporated in toto herein by reference.). Each oligonucleotide is 
HPLC-purified to ensure there are no contaminating or spurious DNA oligonucleotide 
sequences in the final prep. 

The Universal Adaptors are designed to allow directional ligation to the blunt- 

25 ended, fragmented genomic DNA. For each Universal Adaptor pair, the PCR priming 
region contains a 5' four-base overhang and a blunt-ended 3' Key region. Directionality 
is achieved as the blunt-end side of the Universal Adaptor will ligate to the blunt-ended 
DNA fragment while the 5' overhang of the adaptor cannot ligate to the blunt-ended 
DNA fragment. Additionally, a 5' Biotin is added to the Universal Adaptor B to allow 

30 subsequent isolation of ssDNA template (Step 8). Each Universal Adaptor is prepared by 
annealing, in a single tube, the two single-stranded complementary DNA 
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oligonucleotides (i.e., one oligo containing the sense sequence and the second oligo 
containing the anti-sense sequence). 

1. In a 0.2 ml tube, 25 ^1 digested, polished DNA Library, 75 |il 2X Quick Ligase 

Reaction Buffer, 59 \x\ MMP1 (10 pm/(il) adaptor set, 15:1 ratio, and 16 ^1 Quick 
Ligase were added in order. 

2. The ligation reaction was mixed well and incubated at RT for 20 minutes. 



10 3. The ligation reaction was then removed and a 1 0-^iI aliquot of the ligation 

reaction was purified for use on the BioAnalyzer. A single spin column from the 
Qiagen Min-Elute kit was used. The column was eluted with 10 jil EB according 
to the procedure per manufacturers' protocol. A aliquot of the purified 
ligation reaction was loaded using a BioAnalyzer DNA 1000 LabChip. 

15 

This purification step is necessary as the unpurified ligation reaction contains high 
amounts of salt and PEG that will inhibit the sample from running properly on the 
BioAnalyzer. 

20 4. The remainder of the ligation reaction (165 \iL) was used for gel isolation in Step 
4. 

Adaptors 



25 Adaptor "A" (IOjiM) 

The Adaptors (A and B) were HPLC-purified and modified with phosphorothioate 
linkages prior to use. 10 nl of 100 \M Adaptor A (44bp, sense), 10 jil of 100 nM 
Adaptor A (40 bp, anti), and 30 |il of IX Annealing Buffer (V f = 50 were mixed. The 
primers were annealed using the ANNEAL program on the Sample Prep Lab thermal 
30 cycler. Adaptor sets can be stored at -20°C until use. 



Adaptor "B" (10fiM) 

10 \i\ of 100 nM Adaptor B (40bp, sense), 10 \i\ of 100 Adaptor B (44 bp, 
anti), and 30 ^1 of IX Annealing Buffer (Vf = 50 jil) were mixed. The primers were 
35 annealed using the ANNEAL program on the Sample Prep Lab thermal cycler. Adaptor 
sets may be stored at -20°C until use. 
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Adaptor Annealing Program 

ANNEAL-A program for primer annealing: 

/. 95°C, I mm 
2. tol5°C,at O.rC/sec 
5 3. 15°C, forever 

Adaptor Design 

Universal Adaptor Design: 
10 Adaptor A DNA fragment 

Adaptor B [PCR primer (20bp)- Sequencing Primer (20bp)- key (4bp)] — 
DNA Fragment— [key (4bp)-Sequencing Primer (20bp)-PCR primer (20bp)] 

There is no relationship between the orientation of the genomic DNA insert 
15 fragment and the adaptors. Fragments can be ligated in both directions. 

Representative Universal Adaptor Sequence Pair: 
Adaptor A 

20 MMP1A MMP7A key UNA fragment 

5' cgtttcccctgtgtgccttg-ccatctgttccctccctgtc-atgc- 3'--- 5' -gcat-gacacgcaacaggggatagg-qacacqcacgcaacaq 3' 

3' aggggacacacggaac-ggtagacaagggagggacag-tacg- S' 3' -cgta-ctgtgcgttgtcccctatcc-ctgtgcgtgcgttgtctacc-Blotin 5' 

key MMP2B MMPIB 

Adaptor B 

25 

Step 4: Gel Electrophoresis and Extraction of Adapted DNA Library 

The Universal Adaptor ligation protocol will result in the following: 1) 
fragmented DNAs with adaptors on either end; 2) unbound single adaptors; or 3) the 
formation of adaptor dimers. Agarose gel electrophoresis is used as a method to separate 

30 and isolate the adapted DNA library population from the unligated, single adaptors and 
adaptor dimer populations. The procedure of DNase 1 digestion of genomic DNA yields 
a library population that ranges from 50-700bp (Step 1). The addition of the 88-bp 
Universal Adaptor set will shift the population to a larger size and will result in a 
migration profile in the size range of approximately 130-800 bp. Adaptor dimers will 

35 migrate at 88-bp and adaptors unligated will migrate at 44-bp. Therefore, genomic DNA 
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10 



libraries in size ranges >200bp can be physically isolated from the agarose gel and 
purified using standard gel extraction techniques. Gel isolation of the adapted DNA 
library will result in the recovery of a library population in a size range that is >200bp 
(size range of library can be varied depending on application). 

1 . A 2% agarose gel was prepared. 

2. 17 fil of 10X Ready-Load Dye was added to the remaining 165 |il of DNA 
ligation. 

3. All of the dye/ligation reaction was loaded into the gel using four adjacent lanes 
(45.5 nl per lane). 



4. 10 ^1 of thelOO-bp ladder (0.1 |ig/n0 was loaded two lanes away from ligation 
15 reaction lanes. 

5. The gel was run at 100V for 3 hours. 

6. When gel run was complete, the gel was removed from the gel box and the gel 
20 was transferred to a flat surface covered with plastic wrap. DNA bands were 

visualized using a hand-held long-wave UV light. Using a sterile, single-use 
scalpel the library was cut out from agarose gel with fragment sizes between 200 
-400 bp. 

25 Libraries in any size can be isolated. It is also possible to isolate more than one 

size range. Because the library size range is between 200-900bp > it is possible to 
isolate several size ranges from a single well (i.e., 200-400 bp and 500-700 bp). 

7. The agarose-embedded DNA library was isolated using a Qiagen MinElute Gel 
30 Extraction kit. Two MinElute spin columns were used per gel slice (for each size 

range) and each was eluted with 10 |iL EB. Pool eluates were used to produce 20 
|il of DNA library for each size range. 

8. A 1 [iL aliquot of each isolated DNA library was run using a BioAnalyzer DNA 
35 1 000 LabChip to assess the exact distribution of the DNA library population. 

Step 5: Strand Displacement and Extension of Nicked Double-Standed DNA 
Library 

Because the DNA oligonucleotides used for the Universal Adaptors are not 
40 phosphorylated, gaps will be present at the 3' junctions of the fragmented DNAs. These 
two u gaps" or "nicks" can be filled in by using a strand displacing DNA polymerase that 
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lacks 3'->5' exonuclease activity but exhibits 5*->3' exonuclease activity. The 
polymerase will therefore recognize nicks, displace the nicked strands, and extend the 
strand in a manner that results in the removal of the nicks. The strand displacing enzyme, 
Bst I DNA polymerase, is used repair the nicked double-stranded DNA and results in 
5 unnicked double-stranded DNA, 



10 



In a 0.2 ml tube 19 \xl gel-extracted DNA library, 40 ^1 nH 2 0, 8 \i\ 10X ThermoPol 
Reaction Buffer, 8 ^il BSA (1 mg/ml), 2 |al dNTPs (10 mM), and 3 \il Bst I Polymerase 
(8 U/nl) were added in order. 

The samples were mixed well and placed in a thermal cycler and incubated using the 
Strand Displacement incubation program: "BST". 



Strand displacement Program 

1 5 BST program for stand displacement and extension of nicked double-stranded 

DNA: 

7. 65 9 C 30 minutes 

2. 80* C, 10 minutes 

3. 58* C, 10 minutes 
20 4. 14* C, forever 

3. A 1 \xL aliquot of the Bst-treated DNA library was run using a BioAnalyzer DNA 
1000 LabChip. 

Step 6: Preparation of Strepavidin Beads 
25 Following the generation of unnicked double-stranded genomic DNA, it is 

necessary to isolate single-stranded genomic DNAs containing flanking Universal 
Adaptor sequences. This step outlines the binding of biotin-tagged double-stranded DNA 
to streptavidin beads. 



30 100 \i\ Dynal M-270 Streptavidin beads was washed two times with IX Binding 

Buffer (1 M NaCl, 0.5 mM EDTA, 5 mM Tris, pH 7.5) by applying the magnetic beads 
to the MPC. 

I. The beads were resuspended in 1 00 \xl 2X binding buffer, then the remaining 79 
35 jit of the Bst-treated DNA sample (from Step 5) and 20 |il water was added. 
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2. The bead solution was mixed well and placed on a tube rotator at RT for 20 
minutes. The bead mixtures were washed, using the MPC, two times with 100 |il 
oflX Binding Buffer, then washed two times with nH 2 0. 

5 Binding & Washing (B&W) Buffer (2X and IX) 

2X B&W buffer was prepared by mixing 10 mM Tris*HCl (pH 7.5), 1 mM 
EDTA, and 2 M NaCl. The reagents were combined as listed above and mixed 
thoroughly. The solution can be stored at RT for 6 months. 

10 IX B&W buffer was prepared by mixing 2X B&W buffer with nH 2 0, 1:1. The 

final concentrations were half the above: 5 mM Tris^HCl (pH 7.5), 0.5 mM EDTA, and 1 
M NaCl. 

Step 7: Isolation of single-stranded DNA Library using Streptavidin Beads 
15 Following binding of the double-stranded DNA library to streptavidin beads, it is 

necessary to isolate from the ligated pool only the single-stranded DNAs containing 
Universal Adaptor A and Universal Adaptor B (desired populations are designated below 
with asterisks). Double-stranded genomic DNA fragment pools will have adaptors bound 
in the following possible configurations: 

20 

Universal Adaptor A- DNA Fragment 
Universal Adaptor B- DNA Fragment 
Universal Adaptor A- DNA Fragment 
Universal Adaptor B- DNA Fragment 

25 

Because only the Universal Adaptor B has a 5' Biotin moiety, magnetic 
streptavidin-containing beads can be used to bind all DNA library species that possess the 
Universal Adaptor B. Genomic library populations that contain two Universal Adaptor A 
species will not bind to streptavidin-containing beads and will be removed during the 
30 wash procedure. The species that will remain bound to bead after washing will be those 
with Universal Adaptors A and B or those with two Universal Adaptor B ends. Genomic 
DNA species with two Universal Adaptor B sequences (with two biotin molecules) will 



.-Universal Adaptor A 
-Universal Adaptor A* 
.-Universal Adaptor B* 
-Universal Adaptor B 
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be bound to the streptavidin-containing beads on both ends (i.e. both polynucleotides 
comprised in the double stranded molecule will be bound). Species with A and B 
adaptors (only a single biotin molecule) will be bound to the beads only by one end (i.e. 
only one of the polnucleotides comprised in the double stranded molecule will actually be 
5 bound). To isolate the single-stranded population, the bead-bound double-stranded DNA 
is treated with a low salt, hydrogen peroxide solution that will serve to disrupt the 
hydrogen bonding between the complementary DNA strands. If the DNA fragment has 
biotin on each end (Universal Adaptor B ends), the resulting single strands will each 
remain bound to the beads. If the fragment has only a single biotin (Universal Adaptors 

10 A and B) then the complementary strand lacking the biotin moiety will be able to 

separate from the DNA-bead complex. This population of DNA molecules can then be 
subjected to amplification and sequencing as desired. For example, the resulting single- 
stranded genomic DNA library is collected from solution and can be quantitated using 
pyrophosphate sequencing (PyroSequencer, Pyrosequencing MA), real time quantitative 

15 PCR (TaqMan, Applied Biosystems, CA) or capillary gel electrophoresis (RNA LabChip, 
Agilant, CA). Single-stranded genomic DNA libraries are quantitated by calculating the 
number of molecules per unit volume. Single-stranded DNA molecules are then 
annealed (at one copy per bead) to 30-jiicron sepharose beads containing covalently 
attached DNA capture primers (PCR primer sequence B) and amplified using polymerase 

20 chain reaction protocols. Subsequent sequencing is conducted using techniques know to 
one of skill in the art. 



25 



1. To washed beads from Step 6 above, 250 jil Melt Solution was added (0.125 M 
NaOH,0.1 MNaCl). 

2. The bead solution was mixed well and the bead mixture was incubated at room 
temperature for 10 minutes on a tube rotator. 



3. A Dynal MPC was used, the pellet beads were carefully removed and the 
30 supernatant was saved. The 250-(il supernatant contained the single-stranded 

DNA library. 

4. In a separate tube, 1250 jil PB (from QiaQuick Purification kit) was added and 
this solution was neutralized by adding 9 jal of 20% acetic acid. The solution was 

35 mixed well and added to the 250-j.il single-stranded DNA library solution. 
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5. The 1500 fj.1 solution was purified using a single QiaQuick purification spin 
column (load sample through same column two times at 750 \x\ per load). The 
single-stranded DNA library was eluted with 50 |nl EB. 

Melt Solution 

The Melt Solution is 100 mM NaCl, and 125 mM NaOH. These reagents listed 
were combined and mixed thoroughly. The solution can be stored at RT for 6 months. 

1.25 M Acetic Acid 

Add glacial acetic acid to the water, not the contrary (water in acid, morbid; acid 
in water, safer). The solution can be stored at RT for 6 months. 

Step 8: Single-stranded DNA Quantitation using Pyrophosphate Sequencing 
In a 0.2 ml tube, 25 jil single-stranded DNA, 1 jil MMP2B sequencing primer, 
and 14 \i] Annealing Buffer reagents were added in order. 

1. The reaction was allowed to anneal using the ANNEAL-Seq Program. 

2. The reactions were run on a Pyrosequencing apparatus (Pyrosequencing AB, 
Uppsala, Sweden) to determine picomoles of template in sample. The remaining 
25 nL of prepared single-stranded DNA library was used for amplification and 
subsequent sequencing (approximately lxlO 6 reactions). Methods of sequencing 
can be found in U.S. Patent 6,274,320; U.S. Patent 4,863,849; U.S. Patent 
6,210,891; and U.S. Patent 6,258,568, the disclosures of which are incorporated in 
toto herein by reference. The number of single-stranded DNA template molecules 
per microliter was then determined. 

Sequence Primer Annealing Program 

ANNEAL-S program for sequence primer annealing: 

/. 65°C 5 min 

2. to50°Cat 0.1 o C/sec 

3. to 50°C, 1 min 

4. to 40°Cat O.rC/sec 

5. to 40°C, 1 min 
6.lo5°Q atO.rC/sec 
7. 15°C forever 

Step 9: Dilution and Storage of Single-Stranded DNA library 
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Single-stranded DNA library in EB buffer: Remaining final volume = 25 \xl 

• Initial Stock dilution : Using Pyrosequencing (Pyrosequencing AB, Uppsala, 
Sweden) results, dilute single-stranded DNA library to 100M molecules/jaL in IX 
Annealing Buffer (usually this is a 1:50 dilution). 

5 

• Aliquots of single-stranded DNA Library for common use : Dilute to 200,000 
molecules/jaL in IX Annealing Buffer and prepare 30 jiL aliquots. Store at ~20°C. 

• Samples to be utilized in emulsion PCR. 

10 

Reagent preparation 

Stop Solution (50 mM EDTA) 

100 \il of 0.5 M EDTA was mixed with 900 jil of nH 2 0 to make 1.0 ml of 50 mM 
15 EDTA solution. 

IQmMdNTPs (10 \xl dCTP (100 mM), 10 nl dATP (100 mM), 10 \il dGTP (100 
mM), and 10 yl dTTP (100 mM), 60 ^il Molecular Biology Grade water, (nH 2 0)) 

All four 100 mM nucleotide stocks were thawed on ice. 10 (il of each nucleotide 
was combined with 60 \il of nH 2 0 to a final volume of 100 and mixed thoroughly. 1 
20 ml aliquots were dispensed into 1,5 ml microcentrifuge tubes, and stored at -20°C, no 
longer than one year. 

Annealing buffer, 10X 

10 X annealing buffer is 200 mM Tris (pH 7.5) and 50 mM magnesium acetate. 
25 24.23 g Tris was added to 800 ml nH20 and adjust to pH 7.5. To this solution 10.72 g 
was added to magnesium acetate and dissolved completely. The solution was brought up 
to a final volume of 1000 ml. The solution can be stored at 4°C for 1 month. 

lOx TE 

10 X TE is 100 mM Tris'HCl (pH 7.5), and 50 mM EDTA. These reagents were 
30 added together and mixed thoroughly. The solution can be stored at R.T. for 6 months. 
Example 2: Binding Nucleic Acid Template to Capture Beads 
This example describes preparation of a population of beads that preferably have 
only one unique nucleic acid template attached thereto. Successful clonal amplification 
depends on the delivery of a controlled number of template species (0.5 to 1) to each 
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bead. Delivery of excess species can result in PCR amplification of a mixed template 
population, preventing generation of meaningful sequence data. 

600,000 DNA capture beads, in a liquid solution, was transferred to a 1.5 ml tube. 
The suspension solution was removed by centrifiigation and removal of the supernatant. 
5 The beads were washed twice with annealing buffer and resuspended in 100 fil of the 
same buffer. The resuspended beads were transferred to a PCR reaction tube and all but 
about 10 ^il of the buffer was removed. Six \x\ of diluted sstDNA solution comprising 
200,000 DNA molecules/ul were added and the tubes were gently mixed. The DNA were 
annealed to the capture beads by the following thermocycler schedule: 
10 5 minute @ 65 °C 

Decrease by 0. 1 °C /sec to 60 °C 

Hold at 60 °C for i minute 

Decrease by 0. 1 °C /sec to 50 °C 

Hold at 50°Cfor 1 minute 
1 5 Decrease by 0. 1 °C /sec to 40 °C 

Hold at 40 oC for 1 minute 

Decrease by 0.1 °C /sec to 20 °C 

Hold at 10°C until ready for the next step. 

This thermocycling program is referred to herein as the "thermocycler annealing 
20 program.'* 

The resultant beads were centrifuged and the supernatant was removed. The 
capture beads had, on average, 0.5 to 1 copies of sstDNA bound to each bead, and were 
ready for emulsifying. 

25 Example 3: Emulsification 

This example describes how to creating a heat-stable water-in-oil emulsion 
containing about 3,000 PCR microreactors per microliter. Outlined below is a protocol 
for preparing the emulsion. 

200 |il of PCR solution were added to 600,000 beads (as prepared in Example 1) 
30 and incubated at room temperature for 2 minutes. 400 |il of emulsion oil were added to a 
2 ml micro centrifuge tube along with a l A inch magnetic stir bar. The stir bar was spun 
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at 600 rpm while the beads were added dropwise into the emulsion oil solution until the 
solution became emulsified into a homogeneous milky white liquid with the viscosity of 
mayonnaise. Stirring was continued for another 5 minutes. 

A PCR solution suitable for use in this invention is described below. For 200 fj.1 
5 PCR reaction mix (enough for amplifying 600K beads), the following were added to a 0.2 
ml PCR tube: 





Stock 


Final 


Microliters 


HIFI Buffer 


10X 


1 X 


20 


treated Nukes 


10 mM 


1 mM 


20 


Mg 


50 mM 


2mM 


8 


BSA 


10% 


0.1 % 


2 


Tween 80 


1 % 


0.01 % 


2 


Ppase 


2U 


0.003 U 


0.333333 


Primer MMP1 a 


100 uM 


0.625 uM 


1.25 


Primer MMP1b 


10 uM 


0.078 uM 


1.56 


Taq 


5U 


0.2 U 


8 


Water 






136.6 


Total 






200 



The components of a suitable emulsion oil mixture (with emulsion stabilizers) is 
10 described below. 



Ingredient 


Quantity 
Required 


Source 


Ref. Number 


Sigma Light Mineral Oil 


94.5 grams 


Sigma 


M-5904 


Allox4912 


1 gram 


Uniqema 


NA 


Span 80 


4.5 grams 


Umqema 


NA 



The emulsion oil mixture was made by prewarming the Atlox 4912 to 60°C in a 
water bath. Then, 4.5 grams of Span 80 was added to 94.5 grams of mineral oil to form a 
15 mixture. Then one gram of the prewarmed Atlox 4912 was added to the mixture. The 
solutions were placed in a closed container and mixed by shaking and inversion. Any 
sign that the Atlox was settling or solidifying was remedied by wanning the mixture to 
60°C followed by more shaking. 

20 Example 4: Amplification 

This example describes amplifying the template DNA in the bead - emulsion 
mixture. According to this protocol, the DNA amplification phase of the process, 
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according to the present invention, takes 3 to 4 hours. After the amplification is complete, 
the emulsion may be left on the thermocycler for up to 12 hours before the process of 
isolating the beads is begun. 

PCR thermocycling was performed by placing 50 to 100 |il of the emulsified 
5 reaction mix into individual PCR reaction chambers (i.e., PCR tube). PCR was 
performed as follows: 

lx (4 minutes @ 94o C) - Hotstart Initiation 

40x (30 seconds @ 94° C, 30 seconds @ 58° C, 90 seconds @ 68° C) 

25x (30 seconds @ 94 ° C, 3 minutes at 58 0 C) 
10 Storage at 14°C 

Example 5: Breaking the Emulsion and Bead Recovery 
This example describes how to break the emulsion and recover the beads with 
amplified template thereon. The post-PCR emulsion should remain intact. The lower 

15 phase of the emulsion should, by visual inspection, remain a milky white suspension. If 
the solution is clear, the emulsion may have partially resolved into its aqueous and oil 
phases. If the emulsion is broken, it is likely that many of the beads will have a mixture 
of templates - these preparations should be discarded if it is suspected that the emulsion 
broke during the PCR process. 

20 The PCR reactions were combined in a single tube and 500 jil of Sigma Light 

Mineral were added to the tube. The sample was vortexed for 30 seconds and the tube 
was centrifuged at 13.2 K for 20 minutes in an Eppendorf centrifuge. The organic phase 
(oil) was removed and replaced with 1 ml of hexane. The tube was vortexed for 1 minute 
and centrifuged for 5 minutes. The hexane wash was repeated 2 times. The beads were 

25 collected after the wash. 

Example 6: Single Strand Removal and Primer Annealing 

If the beads are to be used in a pyrophosphate-based sequencing reaction, then it 
is necessary to remove the second strand of the PCR product and anneal a sequencing 
30 primer to the single stranded template that is bound to the bead. This example describes a 
protocol for accomplishing that. 
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The beads were washed in 1 ml of water, 1 ml of 1 mM EDTA. Then the beads 
were placed in 1 ml of 0. 1 25 M NaOH for 8 minutes. The beads were washed twice with 
1 ml of annealing buffer. Then the beads were resuspended in 100 \x) of annealing buffer 
and 4 |il of 100 uM sequencing primer. The sample was vortexed and annealed using the 
5 thermocycing program of Example 3. Following annealing, the beads were washed 3 
times in annealing buffer and resuspended in 100 |il of annealing buffer. 

Example 7: Optional Enrichment Step 

The beads may be enriched for amplicon containing bead using the following 

10 procedure. Enrichment is not necessary but would subsequent molecular biology 
techniques, such as DNA sequencing, more efficient. 

50 (al of 10 uM (total 500 pmoles) of biotin-sequencing primer was added to the 
Sepharose beads containing amplicons from the previous example. The beads were 
placed in a thermocycler. The primer was annealed to the DNA on the bead by the 

15 thermocycler annealing program of Example 2. 

After annealing, the sepharose beads were washed three times with annealing 
buffer containing 0.1% Tween 20. The beads, now containing ssDNA fragments 
annealed with biotin-sequencing primers, were concentrated by centrifugation and 
resuspended in 200 \i\ of BST binding buffer. 10 jal of 50,000 unit/ml Bst-polymerase 

20 was added to the resuspended beads and the vessel holding the beads was placed on a 
rotator for five minutes. 2 |il of lOmM dNTP mix (i.e., 2.5 |al each of 10 mM dATP, 
dGTP, dCTP and dTTP) was added and the mixture was incubated for an additional 10 
minutes at room temperature. Then the beads were washed three times with annealing 
buffer containing 0.1% Tween 20 and resuspended in the original volume of annealing 

25 buffer. 

50 |il of Dynal Streptavidin beads (Dynal Biotech Inc., Lake Success, NY; M270 
or MyOne™ beads at 10 mg/ml) was washed three times with annealing buffer 
containing 0.1% Tween 20 and resuspended in the original volume in annealing buffer 
containing 0.1% Tween 20. Then the Dynal bead mixture was added to the resuspended 
30 sepharose beads. Following vortexing, the new mixture was placed in a rotator for 10 
minutes at room temperature. 
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The beads were collected on the bottom of the test tube by centri (ligation at 2300 
g (500 rpm for Eppendorf Centrifuge 541 5D) and resuspended in the original volume of 
annealing buffer containing 0. 1% Tween 20. The mixture, in a test tube, was placed in a 
magnetic separator (Dynal). The beads were washed three times with annealing buffer 
5 containing 0.1% Tween 20 and resuspended in the original volume in the same buffer. 
At this stage, beads without amplicons should have been removed by the previous wash 
steps. Only Sepharose beads containing the appropriated DNA fragments should be 
retained. 

The magnetic beads were separated from the sepharose beads by addition of 500 
10 \x\ of 0.125 M NaOH. The mixture was vortexed and the magnetic beads were removed 
by magnetic separation. The Sepharose beads remaining in solution was transferred to 
another tube and washed with 400 jal of 50 mM Tris Acetate until the pH is stabilized at 
7.6. 



15 Example 8: Nucleic Acid Sequencing Using Bead Emulsion PCR 

The following experiment was performed to test the efficacy of the bead emulsion 
PCR. 600,000 Sepharose beads, with an average diameter of 25-35 um (as supplied my 
the manufacturer) was covalently attached to capture primers at a ratio of 30-50 million 
copy per bead. The beads covalently attached captured primers were mixed with 1.2 

20 million copies of single stranded Adenovirus Library (containing an end that is 
complimentary to the captured primer on the beads). 

The adenovirus library was annealed to the beads using the procedure described in 
Example 1 . Then the beads were resuspended in complete PCR solution. The PCR 
Solution and beads were emulsified in 2 volumes of spinning emulsification oil using the 

25 same procedure described in Example 2. The emulsified (encapsulated) beads were 
subjected to amplification by PCR as outlined in Example 3. The emulsion was broken 
as outlined in Example 4. DNA on beads was rendered single stranded, sequencing 
primer was annealed using the procedure of Example 5. 

70,000 beads were sequenced simultaneously by pryophosphate sequencing using 

30 a pyrophosphate sequencing apparatus as decribed herein (New Haven, CT). Multiple 
batches of 70,000 beads were sequenced and the data were listed in the table below. 
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Alignment Error 


Alignments 




Inferred 


Tolerance 


None 


Single 


Multiple 


Unique 


Coverage 


Read Error 


0% 


47916 


1560 




1110 


54.98% 


0.00% 


5% 


46026 


3450 




2357 


83.16% 


1.88% 


10% 


43474 


6001 


1 


3742 


95.64% 


4.36% 



The Table above shows the results of running a Blast program of the sequences 
obtained from the apparatus of this invention against the known Adenovirus sequence. 
5 The first column is the error tolerance used in the Blast program. The last column is the 
real error as determined by direct comparison to the known sequence. 

Example 9: Template PCR 

30 micron NHS Sepharose beads were coupled with ImM of each of the 
1 0 following primers: 

MMP 1 A: cgtttcccctgtgtgccttg (SEQ ID NO: 1 ) 
MMP1B: ccatctgttgcgtgcgtgtc (SEQ ID NO:2) 

Drive-to-bead PCR was performed in a tube on the MJ thermocyclcr by adding 50 
ul of washed primer-coupled beads to a PCR master mix at a one-to-one volume-to- 
15 volume ratio. The PCR master mix was as follows: 

IX PCR buffer 

1 mM of each dNTP. 

0.625 uM primer MMP 1 A 

0.625 uM primer MMP IB 
20 1 ul of 1 unit/ul Hi Fi Taq Invitrogen, San Diego, CA 

- 5-10ng Template DNA (the DNA to be sequenced) 

The PCR reaction was performed by programming the MJ thermocycler as 
follows: 

94°C for 3 minutes. 39 cycles of 94°C for 30 seconds, 58°C for 30 seconds, 68°C 
25 for 30 seconds. Followed by 94°C for 30 seconds and 58°C for 10 minutes. 10 cycles of 
94°C for 30 seconds, 58°C for 30 seconds, 68°C for 30 seconds. Storage at 10°C. 
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Example 10: Template DNA Preparation and Annealing Sequencing Primer 

The beads from Example 1 were washed two times with distilled water; washed 
once with 1 mM EDTA, and incubated with 0.125 M NaOH for 5 minutes to remove the 
DNA strands not linked to the beads. Then the beads were washed once witli 50 mM Tris 
5 Acetate buffer, and twice with Annealing Buffer 200mM Tris- Acetate, 50mM Mg 
Acetate pH 7.5. 

500 pmoles of Sequencing Primer MMP7A (ccatctgttccctccctgtc) (SEQ ID NO:3) 
and MMP2B-phos (cctatcccctvttgcgrgtc) (SEQ ID NO:4) were added to the beads and 
annealed with the following program on the MJ thermocycler. 60°C for 5 minutes. 
10 Temperature drop of 0.1 degree per second to 50°C. 50°C for 5 minutes. 

Temperature drop of 0.1 degree per second to 4°C. 40°C for 5 minutes. Temperature 
drop of 0.1 degree per second to 10°C. The resultant template was sequenced using 
standard pyrophosphate sequencing. 



1 5 Example 1 1 : Sequencing and Stopping of the First Strand 

The beads were spun into a 55 \xm PicoTiter plate (PTP) at 3000 rpm for 10 
minutes. The PTP was placed on a rig and fun de novo sequencing for a predetermined 
number of cycles. The sequencing was stopped by capping the first strand. The first 
strand was capped by adding lOOul of IX AB (50mM Mg Acetate, 250mM Tricine), 
20 lOOOunit/ml BST polymerase, 0.4 mg/ml single strand DNA binding protein, 1 mM DTT, 
0.4 mg/ml PVP (Polyvinyl Pyrolidone), 10 uM of each ddNTP, and 2.5 uM of each 
dNTP. Apyrase was then flowed over in order to remove excess nucleotides by adding 
IX AB, 0.4 mg/ml PVP, 1 mM DTT, 0.1 mg/ml BSA, 0.125 units/ml apyrase, incubated 
for 20 minutes. 

25 

Example 12: Preparation of Second Strand for Sequencing 

The second strand was unblocked by adding 100 ul of 1 X AB, 0. 1 unit per ml 
poly nucleotide kinase, 5 mM DTT. The resultant template was sequenced using 
standard pyrophosphate sequencing (described, e.g., in US patent 6,274,320, 6258,568 
30 and 6,2 1 0,89 1 , incorporated herein by reference). 
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The results of the sequencing method can be seen in Figure 1 OG where a fragment 
of 174 bp was sequenced on both ends using pyrophosphate sequencing and the methods 
described in these examples. 

5 Example 13 Sequence Analysis of Nucleic Acid on a Picotiter Plate 

The picotiter plate containing amplified nucleic acids as described in Example 2 is 
placed in a perfusion chamber. Then sulfurylase, apyrase, and luciferase are delivered to 
the picotiter plate. 

The sequencing primer primes DNA synthesis extending into the insert suspected 
10 of having a polymorphism, as shown in Figure 1 1 . The sequencing primer is first 
extended by delivering into the perfusion chamber, in succession, a wash solution, a 
DNA polymerase, and one of dTTP, dGTP, dCTP, or a thio dATP (a dATP analog). The 
sulfurylase, luciferase, and apyrase, attached to the termini convert any PPi liberated as 
part of the sequencing reaction to detectable light. The apyrase present degrades any 
15 unreacted dNTP. Light is typically allowed to collect for 3 seconds (although 1-100, e.g., 
2-1 0 seconds is also suitable) by a CCD camera linked to the fiber imaging bundle, after 
which additional wash solution is added to the perfusion chamber to remove excess 
nucleotides and byproducts. The next nucleotide is then added, along with polymerase, 
thereby repeating the cycle. 
20 During the wash the collected light image is transferred from the CCD camera to 

a computer. Light emission is analyzed by the computer and used to determine whether 
the corresponding dNTP has been incorporated into the extended sequence primer. 
Addition of dNTPs and pyrophosphate sequencing reagents is repeated until the sequence 
of the insert region containing the suspected polymorphism is obtained. 

25 

Example 14 On Picotiter Plate PCR Amplification 

Picotiter plate Preparation: In a further embodiment, the single stranded library 
attached to beads can be distributed directly onto the picotiter plate and then the nucleic 
acid template on each bead is amplified (using PCR or other known amplification 
30 technology) to generate sufficient copy number of the template that will generate 
detectable signal in the pyrophosphate-based sequencing methods disclosed herein. 
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Example 15 Sequence Analysis of Nucleic Acid on a PTP 

Reagents: Reagents used for sequence analysis and as controls were the four 
nucleotides and 0.1 jaM Pyrophosphate (PPi) were made in substrate solution, where 
5 substrate refers to a mixture of 300 jaM Luciferin and 4 jjM adenosine 5'-phosphosulfate, 
APS, which are the substrates for the cascade of reactions involving PPi, Luciferase and 
Sulfurylase. The substrate was made in assay buffer. The concentration of PPi used to 
test the enzymes and determine the background levels of reagents passing through the 
chamber was 0.1 |iM. The concentration of the nucleotides, dTTP, dGTP, dCTP was 
10 6.5 nM and that of ctdATP was 50 nM. Each of the nucleotides was mixed with DNA 
polymerase, Klenow at a concentration of 100 U/mL. 

The PTP was placed in the flow chamber of the embodied instrument, and the 
flow chamber was attached to the faceplate of the CCD camera. The PTP was washed by 
flowing substrate (3 ml per min, 2 min) through the chamber. Subsequently, a sequence 
15 of reagents was flown through the chamber by the pump connected to an actuator, which 
was programmed to switch positions, which had tubes inserted in the different reagents. 
Table 1 , lists the sequence of reagents, flow rates and flow times. The camera was set up 
in a fast acquisition mode, with exposure time = 2.5s. 

The signal output from the pad is the average of counts on all the pixels within the 
20 pad. The frame number is equivalent of the time passed during the experiment. The 
graph indicates the flow of the different reagents. 

The disclosures of one or more embodiments of the invention are set forth in the 
accompanying description. Although any methods and materials similar or equivalent to 

25 those described herein can be used in the practice or testing of the present invention, the 
preferred methods and materials are now described. Other features, objects, and 
advantages of the invention will be apparent from the description and from the claims. In 
the specification and the appended claims, the singular forms include plural referents 
unless the context clearly dictates otherwise. Unless defined otherwise, all technical and 

30 scientific terms used herein have the same meaning as commonly understood by one of 
ordinary skill in the art to which this invention belongs. Unless expressly stated 
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otherwise, the techniques employed or contemplated herein are standard methodology 
well known to one of ordinary skill in the art. The examples of embodiments are for 
illustration purposes only. All patents and publications cited in this specification are 
incorporated by reference. 
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CLAIMS 

WE CLAIM: 

1. An array comprising a planar surface with a plurality of cavities thereon, 
each cavity forming an analyte reaction chamber, wherein the reaction chambers have a 
center to center spacing of between 20 to 100 urn and each cavity has a width in at least 
one dimension of between 20 pm and 70 pm, and wherein there are at least 10,000 
reaction chambers. 

2. The array of claim 1 wherein a plurality of reaction chambers contains at 
least 100,000 copies of a single species of single stranded nucleic acid template. 

3. The array of claim 2 wherein the single stranded nucleic acid templates are 
immobilized on mobile solid supports disposed in the reaction chambers. 

4. The array of claim 1 wherein the center to center spacing is between 40 to 

60 pm. 

5. The array of claim 1, wherein each cavity has a depth of between 20 urn 
and 60 pm. 

6. An array comprising a planar top surface and a planar bottom surface 
wherein the planar top surface has at least 10,000 cavities thereon, each cavity forming an 
analyte reaction chamber, the planar bottom surface is optically conductive such that 
optical signals from the reaction chambers can be detected through the bottom planar 
surface, wherein the distance between the top surface and the bottom surface is no greater 
than 5 mm, wherein the reaction chambers have a center to center spacing of between 20 
to 100pm and each chamber having a width in at least one dimension of between 20pm 
and 70pm. 

7. The array of claim 6, wherein the distance between the top surface and the 
bottom surface is no greater than 2 mm. 

8. The array of claim 1 or 6, wherein the number of cavities is greater than 

50,000. 

9. The array of claim 1 or 6, wherein the number of cavities is greater than 
100,000. 
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10. The array of claim 1 or 6, wherein the shape of each reaction chamber is 
substantially hexagonal. 

11. The array of claim 1 or 6, wherein each cavity has at least one irregular 
wall surface. 

12. The array of claim 1 or 6, wherein the array is formed in a fused fiber 
optic bundle. 

13. The array of claim 1 or 6, wherein each cavity has a smooth wall surface. 

14. The array of claim 1 or 6, wherein the cavities are formed by etching one 
end of the fiber optic bundle. 

15. The array of claim 1 or 6, wherein each cavity contains reagents for 
analyzing a nucleic acid or protein. 

16. The array of claim 1 or 6 further comprising a second surface spaced apart 
from the planar array and in opposing contact therewith such that a flow chamber is 
formed over the array. 

17. An array means for carrying out separate parallel common reactions in an 
aqueous environment, wherein the array means comprises a substrate comprising at least 
10,000 discrete reaction chambers containing a starting material that is capable of 
reacting with a reagent, each of the reaction chambers being dimensioned such that when 
one or more fluids containing at least one reagent is delivered into each reaction chamber, 
the diffusion time for the reagent to diffuse out of the well exceeds the time required for 
the starting material to react with the reagent to form a product. 

18. The array of claim 1 7, wherein each cavity contains reagents for analyzing 
a nucleic acid or protein. 

19. The array of claim 17 further comprising a population of mobile solid 
supports disposed in the reaction chambers, each mobile solid support having one or 
more bioactive agents attached thereto. 

20. The array of claim 17, wherein the cavities are formed in the substrate via 
etching, molding or micromaching. 
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21 . The array of claim 1 7, wherein the substrate is a fiber optic bundle. 

22. The array of claims I, 6 or 17, wherein at least 5% to 20% of the reaction 
chambers contain at least one mobile solid support having at least one reagent 
immobilized thereon. 

23. The array of claims 1 , 6 or 17, wherein at least 20% to 60% of the reaction 
chambers have at least one mobile solid support having at least one reagent immobilized 
thereon. 

24. The array of claims 1, 6 or 17, wherein at least 50% to 100% of the 
reaction chambers have at least one mobile solid support having at least one reagent 
immobilized thereon. 

25. The array of claim 22, wherein the reagent immobilized on the mobile 
solid support is a polypeptide with sulfurylase activity. 

26. The array of claims 22, wherein the reagent immobilized on the mobile 
solid support is a polypeptide with luciferase activity. 

27. The array of claims 22, wherein the mobile solid support has both 
sulfurylase and luciferase immobilized. 

28. The array of claim 22 wherein a plurality of reaction chambers contains at 
least 100,000 copies of a single species of single stranded nucleic acid template. 

29. The array of claim 22 wherein the single stranded nucleic acid templates 
are immobilized on mobile solid supports disposed in the reaction chambers. 

30. The array of claim 2 or 28, wherein the nucleic acid is suitable for use in a 
pyrosequencing reaction. 

31. A method for delivering a bioactive agent to an array, comprising 
dispersing over the array a plurality of mobile solid supports, each mobile solid support 
having at least one reagent immobilized thereon, wherein the reagent is suitable for use in 
a nucleic acid sequencing reaction, where the array comprises a planar surface with a 
plurality of reaction chambers disposed thereon, wherein the reaction chambers have a 
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center to center spacing of between 20 to 100 pm and each reaction chamber has a width 
in at least one dimension of between 20 pm and 70 pm. 

32, An apparatus for simultaneously monitoring an array of reaction chambers 
for light indicating that a reaction is taking place at a particular site, the apparatus 
comprising: 

(a) an array of reaction chambers formed from a planar substrate comprising a 
plurality of cavitated surfaces, each cavitated surface forming a reaction chamber adapted 
to contain analytes, and wherein the reaction chambers have a center to center spacing of 
between 20 to 100 urn, each reaction chamber having a volume of between 10 to 150 pL, 
the array comprising more than 10,000 discrete reaction chambers; 

(b) an optically sensitive device arranged so that in use the light from a 
particular reaction chamber will impinge upon a particular predetermined region of said 
optically sensitive device; 

(c) means for determining the light level impinging upon each of said 
predetermined regions and 

(d) means to record the variation of said light level with time for each of said 
reaction chamber 

33. An analytic sensor, comprising: 

(a) an array formed from a first bundle of optical fibers with a plurality of 
cavitated surfaces at one end thereof, each cavitated surface forming a reaction chamber 
adapted to contain analytes, and wherein the reaction chambers have a center to center 
spacing of between 20 to 100 pm, a width of 20 to 70 pm, the array comprising more 
than 10,000 discrete reaction chambers; 

(b) an enzymatic or fluorescent means for generating light in the reaction 
chambers; 

(c) light detection means comprising a light capture means and a second fiber 
optic bundle for transmitting light to the light detecting means, the second fiber optic 
bundle being in optical contact with the array, such that light generated in an individual 
reaction chamber is captured by a separate fiber or groups of separate fibers of the second 
fiber optic bundle for transmission to the light capture means. 
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34. The sensor of claim 33, wherein said sensor is suitable for use in a 
biochemical assay. 

35. The sensor of claim 33, wherein said sensor is suitable for use in a cell- 
based assay. 

36. The sensor of claim 33, wherein the light capture means is a CCD camera. 

37. The sensor of claim 33, wherein the reaction chambers contain one or 
more mobile solid supports with a bioactive agent immobilized thereon. 

38. A method for canying out separate parallel common reactions in an 
aqueous environment, comprising: 

(a) delivering a fluid containing at least one reagent to an array, wherein the 
array comprises a substrate comprising at least 10,000 discrete reaction chambers, each 
reaction chamber adapted to contain analytes, and wherein the reaction chambers have a 
volume of between 10 to 150 pL and containing a starting material that is capable of 
reacting with the reagent, each of the reaction chambers being dimensioned such that 
when the fluid is delivered into each reaction chamber, the diffusion time for the reagent 
to diffuse out of the well exceeds the time required for the starting material to react with 
the reagent to form a product; and 

(b) washing the fluid from the array in the time period (i) after the starting 
material has reacted with the reagent to form a product in each reaction chamber but 
(ii) before the reagent delivered to any one reaction chamber has diffused out of that 
reaction chamber into any other reaction chamber. 

39. The method of claim 38 wherein the product formed in any one reaction 
chamber is independent of the product formed in any other reaction chamber, but is 
generated using one or more common reagents 

40. The method of claim 38 wherein the starting material is a nucleic acid 
sequence and at least one reagent in the fluid is a nucleotide or nucleotide analog 

41. The method of claim 38 wherein the fluid additionally comprises a 
polymerase capable of reacting the nucleic acid sequence and the nucleotide or nucleotide 
analog 
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42. The method of claim 38 additionally comprising repeating steps (a) and 
(b) sequentially. 

43. A method for delivering nucleic acid sequencing enzymes to an array, said 
array having a planar surface with a plurality of cavities thereon, each cavity forming an 
analyte reaction chamber, wherein the reaction chambers have a center to center spacing 
of between 20 to 100 ^m; the method comprising dispersing over the array a plurality of 
mobile solid supports having one or more nucleic acid sequencing enzymes immobilized 
thereon, such that a plurality of the reaction chambers contain at least one mobile solid 
support. 

44. The method of claim 43 wherein one of the nucleic acid sequencing 
enzymes is a polypeptide having sulfurylase activity, luciferase activity or both. 

45. A method for delivering a plurality of nucleic acid templates to an array, 
said array having a planar surface with a plurality of cavities thereon, each cavity forming 
an analyte reaction chamber, wherein the reaction chambers have a center to center 
spacing of between 20 to 100 \im and the array having at least 10,000 reaction chambers; 
the method comprising dispersing over the array a plurality of mobile solid supports, each 
mobile solid support having no more than a single species of nucleic acid template 
immobilized thereon, the dispersion causing no more than one mobile solid support to be 
disposed within any one reaction chamber. 

46. The method of claim 45 wherein the nucleic acid sequence is a single 
stranded nucleic acid. 

47. The method of claim 45 wherein at least 100,000 copies of a single species 
of nucleic acid template are immobilized on a plurality of the mobile solid supports. 

48. The method of claim 45 wherein each single species of nucleic acid 
template is amplified on a picotiter plate to produce at least 2,000,000 copies per well of 
said nucleic acid template after being disposed in the reaction chamber. 

49. The method of claim 48 wherein the nucleic acid sequence is amplified 
using an amplification technology selected from the group consisting of polymerase 
chain reaction, ligase chain reaction and isothermal DNA amplification. 
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50. A method for sequencing a nucleic acid, the method comprising: 

(a) providing a plurality of single-stranded nucleic acid templates disposed 
within a plurality of cavities on a planar surface, each cavity forming an analyte reaction 
chamber, wherein the reaction chambers have a center to center spacing of between 20 to 
100 nm and the planar surface has at least 1 0,000 reaction chambers; 

(b) performing a pyrophosphate based sequencing reaction simultaneously on 
all reaction chambers by annealing an effective amount of a sequencing primer to the 
nucleic acid templates and extending the sequencing primer with a polymerase and a 
predetermined nucleotide triphosphate to yield a sequencing product and, if the 
predetermined nucleotide triphosphate is incorporated onto the 3' end of said sequencing 
primer, a sequencing reaction byproduct; and 

(c) identifying the sequencing reaction byproduct, thereby determining the 
sequence of the nucleic acid in each reaction chamber. 

51. The method of claim 50, wherein the sequencing reaction byproduct is PPi 
and a coupled sulfiirylase/luciferase reaction is used to generate light for detection. 

52. The method of claim 51, wherein either or both of the sulfiirylase and 
luciferase are immobilized on one or more mobile solid supports disposed at each 
reaction site. 

53. A method of determining the base sequence of a plurality of nucleotides 
on an array, the method comprising: 

(a) providing at least 10,000 DNA templates, each separately disposed within 
a plurality of cavities on a planar surface, each cavity forming an analyte reaction 
chamber, wherein the reaction chambers have a center to center spacing of between 20 to 
100 urn, and a volume of between 10 to 150 pL; wherein 

(b) adding an activated nucleotide S'-triphosphate precursor of one known 
nitrogenous base to a reaction mixture in each reaction chamber, each reaction mixture 
comprising a template-directed nucleotide polymerase and a single-stranded 
polynucleotide template hybridized to a complementary oligonucleotide primer strand at 
least one nucleotide residue shorter than the templates to form at least one unpaired 
nucleotide residue in each template at the 3'-end of the primer strand, under reaction 
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conditions which allow incorporation of the activated nucleoside 5'-triphosphate 
precursor onto the 3' -end of the primer strands, provided the nitrogenous base of the 
activated nucleoside 5' -triphosphate precursor is complementary to the nitrogenous base 
of the unpaired nucleotide residue of the templates; 

(c) detecting whether or not the nucleoside 5 '-triphosphate precursor was 
incorporated into the primer strands in which incorporation of the nucleoside 5'- 
triphosphate precursor indicates that the unpaired nucleotide residue of the template has a 
nitrogenous base composition that is complementary to that of the incorporated 
nucleoside S'-triphosphate precursor; and 

(d) sequentially repeating steps (b) and (c), wherein each sequential repetition 
adds and, detects the incorporation of one type of activated nucleoside 5 '-triphosphate 
precursor of known nitrogenous base composition; and 

(e) determining the base sequence of the unpaired nucleotide residues of the 
template in each reaction chamber from the sequence of incorporation of said nucleoside 
precursors. 

54. A method of identifying the base in a target position in a DNA sequence 
of template DNA, wherein: 

(a) at least 1 0,000 separate DNA templates are separately disposed within a 
plurality of cavities on a planar surface, each cavity forming an analyte reaction chamber, 
wherein the reaction chambers have a center to center spacing of between 20 to 100 |im, 
said DNA being rendered single stranded either before or after being disposed in the 
reaction chambers, 

(b) an extension primer is provided which hybridizes to said immobilized 
single-stranded DNA at a position immediately adjacent to said target position; 

(c) said immobilized single-stranded DNA is subjected to a polymerase 
reaction in the presence of a predetermined deoxynucleotide or dideoxynucleotide, 
wherein if the predetermined deoxynucleotide or dideoxynucleotide is incorporated onto 
the 3 5 end of said sequencing primer then a sequencing reaction byproduct is formed; and 

(d) identifying the sequencing reaction byproduct, thereby determining the 
nucleotide complementary to the base at said target position in each of the 10,000 DNA 
templates. 
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55. The method of claim 54 wherein in place of deoxy- or dideoxy adenosine 
triphosphate (ATP) a dATP or ddATP analogue is used which is capable of acting as a 
substrate for a polymerase but incapable of acting as a substrate for a said PPi — detection 
enzyme. 

56. An apparatus for analyzing a nucleic acid sequence, the apparatus 
comprising: 

(a) a reagent delivery cuvette, wherein the cuvette includes an array 
comprising a planar surface with a plurality of cavities thereon, each cavity forming an 
analyte reaction chamber, wherein the reaction chambers have a center to center spacing 
of between 20 to 100 urn, and there are in excess of 10,000 reaction chambers, and 
wherein the reagent delivery cuvette contains reagents for use in a sequencing reaction; 

(b) a reagent delivery means in communication with the reagent delivery 
cuvette; 

(c) an imaging system in communication with the reagent delivery chamber; 

and 

(d) a data collection system in communication with the imaging system. 

57. An apparatus for determining the base sequence of a plurality of 
nucleotides on an array, the apparatus comprising: 

(a) a reagent cuvette containing a plurality of cavities on a planar surface, 
each cavity forming an analyte reaction chamber, wherein there are in excess of 10,000 
reaction chambers, each having a center to center spacing of between 20 to 100 ^m and a 
volume of between 10 to 150 pL; 

(b) reagent delivery means for simultaneoulsy adding to each reaction 
chamber an activated nucleotide 5 '-triphosphate precursor of one known nitrogenous 
base to a reaction mixture in each reaction chamber, each reaction mixture comprising a 
template-directed nucleotide polymerase and a single-stranded polynucleotide template 
hybridized to a complementary oligonucleotide primer strand at least one nucleotide 
residue shorter than the templates to form at least one unpaired nucleotide residue in each 
template at the 3'-cnd of the primer strand, under reaction conditions which allow 
incorporation of the activated nucleoside S'-triphosphate precursor onto the 3'-cnd of the 
primer strands, provided the nitrogenous base of the activated nucleoside S'-triphosphate 
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precursor is complementary to the nitrogenous base of the unpaired nucleotide residue of 
the templates; 

(c) detection means for detecting in each reaction chamber whether or not the 
nucleoside 5'-triphosphate precursor was incorporated into the primer strands in which 
incorporation of the nucleoside 5 '-triphosphate precursor indicates that the unpaired 
nucleotide residue of the template has a nitrogenous base composition that is 
complementary to that of the incorporated nucleoside 5' -triphosphate precursor; and 

(d) means for sequentially repeating steps (b) and (c), wherein each sequential 
repetition adds and, detects the incorporation of one type of activated nucleoside 5'- 
triphosphate precursor of known nitrogenous base composition; and 

(e) data processing means for determining the base sequence of the unpaired 
nucleotide residues of the template simultaneously in each reaction chamber from the 
sequence of incorporation of said nucleoside precursors. 

58. An apparatus for processing a plurality of analytes, the apparatus 
comprising: 

(a) a flow chamber having disposed therein a substrate comprising at least 
50,000 cavitated surfaces on a fiber optic bundle, each cavitated surface forming a 
reaction chamber adapted to contain analytes, and wherein the reaction chambers have a 
center to center spacing of between 20 to 100 jim and a diameter of 20 to 70 \xm; 

(b) fluid means for delivering processing reagents from one or more reservoirs 
to the flow chamber so that the analytes disposed in the reaction chambers are exposed to 
the reagents; and 

(c) detection means for simultaneously detecting a sequence of optical signals 
from each of the reaction chambers, each optical signal of the sequence being indicative 
of an interaction between a processing reagent and the analyte disposed in the reaction 
chamber, wherein the detection means is in communication with the cavitated surfaces. 

59. The apparatus of claim 58, wherein the detection means is a CCD camera. 

60. The apparatus of claim 58, wherein the analyte is nucleic acid. 

61 . The apparatus of claim 58, wherein the analytes are immobilized on one or 
more mobile solid supports that are disposed in the reaction chamber. 
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62. The apparatus of claim 58 wherein the processing reagents are 
immobilized on one or more mobile solid supports. 

63. A method for sequencing a nucleic acid, the method comprising: 

(a) providing a plurality of single-stranded nucleic acid templates in an array 
having at least 50,000 discrete reaction sites; 

(b) contacting the nucleic acid templates with reagents necessary to perform a 
pyrophosphale-based sequencing reaction coupled to light emission; 

(c) detecting the light emitted from a plurality of reaction sites on respective 
portions of an optically sensitive device; 

(d) converting the light impinging upon each of said portions of said optically 
sensitive device into an electrical signal which is distinguishable from the signals from all 
of said other reaction sites; 

(e) determining the sequence of the nucleic acid templates based on light 
emission for each of said discrete reaction sites from the corresponding electrical signal; 
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Abstract 



An apparatus and method for performing rapid DNA sequencing, such as genomic 
sequencing, is provided herein. The method includes the steps of preparing a sample 
DNA for genomic sequencing, amplifying the prepared DNA in a representative manner, 
and performing multiple sequencing reaction on the amplified DNA with only one primer 
hybridization step. 
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METHODS OF AMPLIFYING AND SEQUENCING NUCLEIC ACIDS 
RELATED APPLICATIONS 

5 This application claims Hie benefit of priority to the following applications: 

USSN: 60/476,602, filed June 6, 2003, entitled ' Method For Preparing Single-Stranded 
DNA Libraries (21465-507 PRO)", USSN: 60/476,504, filed June 6, 2003. entitled "Bead 
Emulsion Nucleic Acid Amplification (21465-508 PRO)", USSN: 60/443.471. filed 
January 29, 2003. entitled "Double Ended Sequencing (21465-509 PROA)", USSN: 

10 60/476,313, filed June 6, 2003, entitled "Double Ended Sequencing (21465-509 PROB)'", 
USSN: 60/476,592, filed June 6, 2003, entitled "Methods Of Amplifying And 
Sequencing Nucleic Acids (21465-510 PRO A)", USSN: 60/465.071, filed April 23, 
2003. entitled "Massively Parallel High Throughput, Low Cost Sequencing (21465-510 
PRO B)". and USSN: 60/497,985, filed August 25, 2003, entitled "A Massively Parallel 

15 Picotiterplate-Based Platform For Discrete Picoliter-Scale Polymerase Chain Reactions 
(21465-510 PRO C)." 

This application incorporates by reference the following copending U.S. patent 
applications: application entitled "Method For Preparing Single-Stranded DNA 
Libraries (21465-507)", filed January 28, 2004, with a serial number to be assigned, 

20 application entitled "Bead Emulsion Nucleic Acid Amplification (21465-508)", filed 
January 28, 2004, with a serial number to be assigned, application entitled "Double 
Ended Sequencing (21465-509 PROA)," filed January 28, 2004, with a serial number to 
be assigned, 

AH patent and patent applications referred to in this application are hereby fully 
25 incorporated by reference. 

FIELD OF THE INVENTION 

This invention relates to a method and apparatus for determining the base 
sequences of DNA. More particularly, this invention relates to methods and an apparatus 
30 with which the base sequences of a genome can be amplified and determined 
automatically or semiautomatically. 
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BACKGROUND OF THE INVENTION 

Development of rapid and sensitive nucleic acid sequencing methods utilizing 
automated DNA sequencers has revolutionized modern molecular biology. Analysis of 
entire genomes of plants, fungi, animals, bacteria, and viruses is now possible with a 
5 concerted effort by a series of machines and a team of technicians. However, the goal of 
rapid and automated or semiautomatic sequencing of a genome in a short time has not 
been possible. There continues to be technical problems for accurate sample preparation, 
amplification and sequencing. 

One technical problem which hinders sequence analysis of genomes has been the 

10 inability of the investigator to rapidly prepare numerous nucleic acid sample 
encompassing a complete genome in a short period of time. 

Another technical problem is the inability to representatively amplified a genome 
to a level that is compatible with the sensitivity of current sequencing methods. Modem 
economically feasible sequencing machines, while sensitive, still require in excess of one 

1 5 million copies of a DNA fragment for sequencing. Current methods for providing high 
copies of DNA sequencing involves variations of cloning or in vitro amplification which 
cannot amplify the number of individual clones (600,000 or more, and tens of millions 
for a human genome) necessary for sequencing a whole genome economically. 

Yet another technical problem in the limitation of current sequencing methods 

20 which can perform, at most, one sequencing reaction per hybridization of oligonucleotide 
primer. The hybridization of sequencing primers is often the rate limiting step 
constricting the throughput of DNA sequencers. 

In most cases, Polymerase Chain Reaction (PCR; Saiki, R.K., et al. y Science 1985, 
230, 1350-1354; Mullis, K„ et ai y Cold Spring Harb. Symp. Quant Biol 1986, 5 1 Pt 1, 

25 263-273) plays an integral part in obtaining DNA sequence information, amplifying 

minute amounts of specific DNA to obtain concentrations sufficient for sequencing. Yet, 
scaling current PCR technology to meet the increasing demands of modern genetics is 
neither cost effective nor efficient, especially when the requirements for full genome 
sequencing are considered. 

30 Efforts to maximize time and cost efficiencies have typically focused on two 

areas: decreasing the reaction volume required for amplifications and increasing the 
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number of simultaneous amplifications performed. Miniaturization confers the advantage 
of lowered sample and reagent utilization, decreased amplification times and increased 
throughput scalability. 

While conventional thermocyclers require relatively long cycling times due to 
5 thermal mass restrictions (Woolley, AT., et al, Anal. Chem. 1 996, 68, 4081-4086). 
smaller reaction volumes can be cycled more rapidly. Continuous flow PCR devices 
have utilized etched microchannels in conjunction with fixed temperature zones to reduce 
reaction volumes to sub-microliter sample levels (Lagally, E.T., et al, Analytical 
Chemistry 2001, 73, 565-570; Schneegas, L, et al, Lab on a Chip - The Royal Society of 
10 Chemistry 2001 , 1, 42-49). 

Glass microcapillaries, heated by air (Kalinina, O., et al, Nucleic Acids Res. 

1997, 25, 1999-2004) or infrared light (Oda, R.P., et al, Anal. Chem. 1998, 70, 4361- 
4368; Huhmer, A.F. and Landers, J.P., Anal. Chem. 2000, 72, 5507-5512), have also 
served as efficient vessels for nanoliter scale reactions. Similar reaction volumes have 

1 5 been attained with microfabricatcd silicon thermocyclers (Bums, M.A., et al, Proc. Natl. 
Acad Sci. USA 1996, 93, 5556-5561). 

In many cases, these miniaturizations have reduced total PCR reaction times to 
less than 30 minutes for modified electric heating elements (Kopp, M.U., et al, Science 

1998, 280, 1046-1048; Chiou, J., Matsudaira, P. ? Sonin, A. and Ehrlich, D.,Anal Chem. 
20 2001, 73, 2018-2021) and hot air cyclers (Kalinina, O., et at., Nucleic Acids Res. 1997, 

25, 1999-2004). and to 240 seconds for some infrared controlled reactions (Giordano, 

B.C., etal,Anal Biochem. 2001, 291, 124-132). 

Certain technologies employ increased throughput and miniaturization 

simultaneously; as in the 1 536-well system design by Sasaki et al. (Sasaki, N., et al, 
25 DNA Res. 1997, 4, 387-391 ). which maintained reaction volumes under 1 jil. As another 

example, Nagai et al. (Nagai, H„ et al, Biosens. Bioelectron. 2001, 16, 1015-1019; 

Nagai, H., et al, Anal Chem. 200 1 , 73, 1043-1047) reported amplification of a single test 

fragment in ten thousand 86 pi reaction pits etched in a single silicon wafer. 

Unfortunately, recovery and utilization of the amplicon from these methods have proven 
30 problematic, requiring evaporation through selectively permeable membranes. 
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Despite these remarkable improvements in reactions volumes and cycle times, 
none of the previous strategies have provided the massively parallel amplification 
required to dramatically increase throughput to levels required for analysis of the entire 
human genome. DNA sequencers continue to be slower and more expensive than would 
5 be desired. In the pure research setting it is perhaps acceptable if a sequencer is slow and 
expensive. But when it is desired to use DNA sequencers in a clinical diagnostic setting 
such inefficient sequencing methods are prohibitive even for a well financed 
institution.The large-scale parallel sequencing of thousands of clonally amplified targets 
would greatly facilitate large-scale, whole genome library analysis without the time 

10 consuming sample preparation process and expensive, error-prone cloning processes. 
Successful high capacity, solid-phase, clonal DNA amplification can be used for 
numerous applications. Accordingly, it is clear that there exists a need for preparation of 
a genome or large template nucleic acids for sequencing, for amplification of the nucleic 
acid template, and for the sequencing of the amplified template nucleic acids without the 

1 5 constraint of one sequencing reaction per hybridization. Furthermore, there is a need for 
a system to connect these various technologies into a viable automatic or semiautomatic 
sequencing machine. 

BRIEF SUMMARY OF THE INVENTION 

20 This invention describes an integrated system, comprising novel methods and 

novel apparatus for (1 ) nucleic acid sample preparation, (2) nucleic acid amplification, 
and (3) DNA sequencing. 

The invention provides a novel method for preparing a library of multiple DNA 
sequences, particularly derived from large template DNA or whole (or partial) genome 

25 DNA. Sequences of single stranded DNA are prepared from a sample of large template 
DNA or whole or partial DNA genomes through fragmentation, polishing, adaptor 
ligation, nick repair, and isolation of single stranded DNA. The method provides for 
generating a ssDNA library linked to solid supports comprising: (a) generating a library 
of ssDNA templates; (b) attaching the ssDNA templates to solid supports; and (c) 

30 isolating the solid supports on which one ssDNA template is attached. 
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The invention also provides for a method of amplifying each individual member 
of a DNA library in a single reaction tube, by, e.g., encapsulating a plurality of DNA 
samples individually in a microcapsule of an emulsion, performing amplification of the 
plurality of encapsulated nucleic acid samples simultaneously, and releasing said 
5 amplified plurality of DNA from the microcapsules for subsequent reactions. In one 
embodiment, single copies of the nucleic acid template species are hybridized to DNA 
capture beads, suspended in complete amplification solution and emulsified into 
microreactors (typically 100 to 200 microns in diameter), after which amplification (e.g., 
PCR) is used to clonally increase copy number of the initial template species to more 

10 than 1 .000,000 copies of a single nucleic acid sequence, preferably between 2 and 20 
million copies of a single nucleic acid. The amplification reaction, for example, may be 
performed simultaneously with at least 3,000 microreactors per microliter of reaction 
mix, and may be performed with over 300,000 microreactors in a single lOOpJ volume 
test tube (e.g., a PCR reaction tube). The present invention also provides for a method of 

15 enriching for those beads that contains a successful DNA amplification event (i.e., by 
removing beads that have no DNA attached thereto). 

The invention also provides for a method of sequencing a nucleic acid from 
multiple primers with a single primer hybridization step. Two or more sequencing 
primers are hybridized to the template DNA to be sequenced. All the sequencing primers 

20 are then protected except for one. Sequencing (e.g., pyrophosphate sequencing) is 

performed again by elongating the unprotected primer. The elongation is either allowed 
to go to completion (with additional polymerase and dNTPs if necessary) or is terminated 
(by polymerase and ddNTPs). Chain completion and/or termination reagents are 
removed. Then one of the protected primers is unprotected and sequencing is performed 

25 by elongating the newly unprotected primer. This process is continued until all the 
sequencing primers are deprotected and sequenced, hi a preferred embodiment, two 
primers (one protected and one unprotected) are used to sequence both ends of a double 
stranded nucleic acid. 

The invention also provides an apparatus and methods for sequencing nucleic 

30 acids using a pyrophosphate based sequencing approach. The apparatus has a charge 
coupled device (CCD) camera, micro fluidics chamber, sample cartridge holder, pump 
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and flow valves. The apparatus uses chemi luminescence as the detection method, which 
for pyrophosphate sequencing has an inherently low background. In a preferred 
embodiment, the sample cartridge for sequencing is termed the TicoTiter plate/ and is 
formed from a commercial fiber optics faceplate, acid-etched to yield hundreds of 

5 thousands of very small wells, each well volume of 75 pL. The apparatus includes a 
novel reagent delivery cuvette adapted for use with the arrays described herein, to 
provide fluid reagents to the picotiter plate, and a reagent delivery means in 
communication with the reagent delivery cuvette. Photons from each well on the 
picotiter plate are channeled into specific pixels on the CCD camera to detect sequencing 

10 reactions. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 depicts a schematic representation of the entire process of library 
preparation including the steps of template DNA fragmentation (Figure 

15 1A), end polishing (Figure IB), adaptor ligation (Figure 1C), nick repair, 

strand extension and gel isolation (Figure ID). Figure IE depicts a 
schematic representation of the stages for amplification and sequencing of 
template DNA (Figure IE). Figure IF depicts a representative agarose gel 
containing a sample preparation of a 180-350 base pair adenovirus DNA 

20 library according to the methods of this invention. Figure 1G depicts a 

detailed schematic representation of library preparation, amplification, and 
sequencing. 

Figure 2A depicts a schematic representation of the universal adaptor design 
according to the present invention. Each universal adaptor is generated 

25 from two complementary ssDNA oligonucleotides that are designed to 

contain a 20 bp nucleotide sequence for PCR priming, a 20 bp nucleotide 
sequence for sequence priming and a unique 4 bp discriminating sequence 
comprised of a non-repeating nucleotide sequence {i.e., ACGT, CAGT, 
etc.). Figure 2B depicts a representative universal adaptor sequence pair 

30 for use with the invention. Adaptor A sense strand: SEQ ID NO:l; 

Adaptor A antisense strand: SEQ ID NO:2; Adaptor B sense strand: SEQ 
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ID NO:3: Adaptor B antisense strand: SEQ ED NO:4. Figure 2C depicts a 
schematic representation of universal adaptor design for use with the 
invention. 

Figure 3 depicts the strand displacement and extension of nicked double-stranded 

5 DNA fragments according to the present invention. Following the ligation 

of universal adaptors generated from synthetic oligonucleotides, double- 
stranded DNA fragments will be generated that contain two nicked regions 
following T4 DNA ligase treatment (Figure 3 A). The addition of a strand 
displacing enzyme (i.e., Bst DNA polymerase I) will bind nicks (Figure 

10 3B), strand displace the nicked strand and complete nucleotide extension 

of the strand (Figure 3C) to produce non-nicked double-stranded DNA 
fragments (Figure 3D). 
Figure 4 depicts the isolation of directionally-ligated single-stranded DNA 
according to the present invention using streptavidin-coated beads. 

15 Following ligation with universal adaptors A and B (the two different 

adaptors are sometimes referred to as a "first" and "second" universal 
adaptor), double-stranded DNA will contain adaptors in four possible 
combinations: AA, BB, AB and BA. When universal adaptor B contains 
a S'-biolin, magnetic streptavidin-coated solid supports are used to capture 

20 and isolate the AB, BA and BB populations (population AA is washed 

away). The BB population is retained on the beads as each end of the 
double-stranded DNA is attached to a bead and is not released. However, 
upon washing in the presence of a low salt buffer, only populations AB 
and BA will release a single-stranded DNA fragment that is 

25 complementary to the bound strand. Single-stranded DNA fragments are 

isolated from the supernatant and used as template for subsequent 
amplification and sequencing. This method is described below in more 
detail. 

Figure 5 depicts a schematic of the structure of a DNA capture bead. 
30 Figure 6 depicts a schematic of one embodiment of a bead emulsion amplification 
process. 
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Figure 7 depicts a schematic of an enrichment process to remove heads thai do not 

have any DNA attached thereto. 
Figures 8A-B depict a schematic representation of the double ended sequencing reaction 

according to the present invention. 
5 Figure 9 depicts a double-ended sequencing demonstration on a pyrosequencing 

apparatus according to the invention. 
Figures 10A-F depict an exemplary double ended sequencing process. 
Figures 1 1 A-D depict schematic illustrations of rolling circle-based amplification using 

an anchor primer. 

10 Figure 12 depicts a drawing of a sequencing apparatus according to the present 
invention. 

Figure 13 depicts a drawing of a reagent delivery / perfusion chamber according to 
the present invention. 

Figure 14 depicts a micrograph of a cavitated fiber optic bundle, termed a PicoTiter 
1 5 Plate™, of the invention. 

Figure 15 depicts a micrograph of a picotiter plate carpeted with beads having DNA 
template immobilized thereon and sulfurylase and luciferase immobilized 
thereon. 

Figure 16 depicts a schematic illustration of the reagent flow chamber and FORA 
20 (PicoTiter Plate™). 

Figure 17 depicts a diagram of the analytical instrument of the present invention. 
Figure 18 depicts a schematic illustration of microscopic parallel sequencing 

reactions within a PicoTiter Plate™. 
Figure 19 depicts a micrograph of single well reactions. 
25 Figure 20 depicts a PicoTiterPlate™ loading cartridge. "A" refers to a 
PicoTiterPlate™ with microwells facing into the cartridge, the distance 
between the open sides of the PicoTiterPlate™ wells and the wall of the 
loading cartridge is 0.3 mm; "B" refers to a silicon sealing gasket; "C" 
refers to an inlet port; "D" refers to an inlet loading tube; U E" refers to an 
30 outlet port and "F" refers to an outlet tube. The PicoTiterPlate™ is held in 

the cartridge with plastic clamps. The liquid is filled via the inlet loading 
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tube D and enters the space between the open sides of the PicoTiterPlate™ 
wells and the wall of the laoding cartridge through the inlet port C. The 
area defined by the silicon sealing gasket B is filled and excess liquid 
leaves the cartridge via the outlet port E and the outlet tubing F. 

5 Figure 21 depicts a PicoTiterPlate™ amplification chamber in exploded view. "A" 
refers to an amplification chamber lid with six retaining bolts; "B" refers 
to a closed cell foam insulation pad; T refers to a 25 mm by 75 mm 
standard glass microscope slide; "D" refers to a 0.25 mm thick silicon 
sheet; "E" refers to a PicoTiterPlate™; "F" refers to an amplification 

1 0 chamber base; "G" refers to a second 0.25 mm thick silicon sheet. 

Figure 22 depicts a schematic diagram of solid phase PicoTiterPlate™ PCR. The 
cylindrical structures symbolize individual PicoTiterPlate™ wells. Gray 
spheres symbolize beads with immobilized primers. Forward "F" (red) 
and Reverse "R" (blue) primers are shown in 5' to 3' orientation as 

15 indicated by arrows. Synthesized sequence complementary to the Forward 

and Reverse primers are shown as dark red (F complement) and dark blue 
(R complement) bars. Single stranded template DNA is shown as solid 
gray line, newly synthesized DNA strands as dashed gray lines. 
Fluorescently labeled hybridization probes are shown as green bars. 

20 Figures 23A-C depict fluorescent probes hybridization to bead-immobilized test DNA 

fragments. Figure 23A (upper left) and 23B (upper right) illustrate the 
specificity of a mixed population of probes hybridized to fragment A and 
fragment B immobilized on control beads, respectively. Fragment B 
beads exhibited the Alexa Fluor 647 signal (red), and the fragment A 

25 beads exhibited the Alexa Fluor 488 signal (green). Figure 23C (bottom 

panel) depicts probe fluorescence from DNA capture beads after PTPCR. 
Beads displayed homogenous fragment A and fragment B signals, as well 
as mixes of templates, shown as varying degrees of yellow. 
Figure 24 depicts representative BioAnalyzer output from analysis of a single 

30 stranded DNA library. 

Figure 25 depicts an insert flanked by PCR primers and sequencing primers. 
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Figure 26 depicts truncated product produced by PCR primer mismatch at cross- 
hybridization region (CHR). 

Figure 27 depicts the calculation for primer candidates based on melting 
temperature. 

Figures 28A-D depict the assembly for the nebulizer used for the methods of the 

invention. A tube cap was placed over the top of the nebulizer (Figure 
7A) and the cap was secured with a nebulizer clamp assembly (Figure 
7B). The bottom of the nebulizer was attached to the nitrogen supply 
(Figure 7C) and the entire device was wrapped in parafilm (Figure 7D). 

Figure 29A depicts representative results for LabChip analysis of a single stranded 
DNA library following nebulization and polishing. 

Figure 29B depicts representative size distribution results for an adaptor-ligated single 
stranded DNA library following nebulization, polishing, and gel 
purification. 

Figure 30 depiction of jig used to hold tubes on the stir plate below vertical syringe 
pump. The jig was modified to hold three sets of bead emulsion 
amplification reaction mixtures. The syringe was loaded with the PCR 
reaction mixture and beads. 

Figure 31 depiction of optimal placement of syringes in vertical syringe pump and 
orientation of emulsion tubes below syringe outlets. 

Figure 32 depiction of optimal placement of syringe pump pusher block against 
syringe plungers, and optimal orientation of jig on the stir plate. Using 
this arrangement, the syringe contents were expelled into the agitated 
emulsion oil. 

Figure 33 depiction of beads (see arrows) suspended in individual microreactors 

according to the methods of the invention. 
Figure 34 depiction of double ended sequencing results showing that the sequence of 

both ends of a DNA template are determined. SEQ ID NO:44: 

atgcacatggttgacacagtggt; SEQ ID NO:45: atgcacatggttgacacagtgg; SEQ ID 

NO:46: atgccaccgacctagtctcaaactt. 
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Figure 35 illustrates the encapsulation of a bead comprising two oligonucleotide 

sequences for double stranded sequencing. 
Figure 36 illustrates solution phase PCR and drive to bead procedure - a step in a 

preferred embodiment of double ended sequencing. 
5 Figure 37 illustrates emulsion breaking and recovery of amplified template DNA on 

a bead - a step in a preferred embodiment of double ended sequencing. 
Figure 38 depicts a schematic representation of a preferred method of double 

stranded sequencing. 

Figure 39 illustrates the results of sequencing a Staphylococcus aureus genome. 
10 Figure 40 illustrates the average read lengths in one experiment involving double 
ended sequencing. 

Figure 41 illustrates the number of wells for each genome span in a double ended 

sequencing experiment. 
Figure 42 illustrates a typical output and alignment string from a double ended 
15 sequencing procedure. Sequences shown in order, from top to bottom: 

SEQ ID NO:47-SEQ ID NO:60. 
DETAILED DESCRIPTION OF THE INVENTION 
A novel platform is described herein which permits simultaneous amplification of 
three hundred thousand discrete PCR reactions (PTPCR) in volumes as low as 39.5 
20 picoliters. The pooled PTPCR products from the entire reaction can be recovered 

through a wash step and assayed via real-time PCR for the presence and abundance of 
specific templates. Of greater interest, it is shown herein that these PTPCR products can 
be driven to solid supports and detected by hybridization with two color fluorescent 
probes, allowing high capacity, solid-phase, clonal DNA amplification and large-scale 
25 parallel sequencing. 

The present invention is directed to a method and apparatus for performing 
genomic sequencing which satisfies the objectives of (1 ) preparing a nucleic acid (e.g., a 
genome) in a rapid and efficient manner for sequencing, (2) amplifying the nucleic acid 
in a representative manner, and (3) performing multiple sequencing reactions with only 
30 one primer hybridization. The present invention is particularly suited for genotyping, 
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detection and diagnosis from a small sample of nucleic acid in a cost efficient manner. 

Each of these objectives are listed below. 

Definitions: 

Unless otherwise defined, all technical and scientific terms used herein have the 
5 same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Methods and materials similar or equivalent to those described herein 
can be used in the practice of the present invention, and exemplified suitable methods and 
materials are described below. For example, methods may be described which comprise 
more than two steps. In such methods, not all steps may be required to achieve a defined 

10 goal and the invention envisions the use of isolated steps to achieve these discrete goals. 
The disclosures of all publications, patent applications, patents and other references are 
incorporated in toto herein by reference. In addition, the materials, methods, and 
examples are illustrative only and not intended to be limiting. 

As used herein, the term "universal adaptor" refers to two complementary and 

1 5 annealed oligonucleotides that are designed to contain a nucleotide sequence for PCR 
priming and a nucleotide sequence for sequence priming. Optionally, the universal 
adaptor may further include a unique discriminating key sequence comprised of a non- 
repeating nucleotide sequence (i.e., ACGT, CAGT, etc.). A set of universal adaptors 
comprises two unique and distinct double-stranded sequences that can be ligated to the 

20 ends of double-stranded DNA. Therefore, the same universal adaptor or different 

universal adaptors can be ligated to either end of the DNA molecule. When comprised in 
a larger DNA molecule that is single stranded or when present as an oligonucleotide, the 
universal adaptor may be referred to as a single stranded universal adaptor. 

"Target DNA" shall mean a DNA whose sequence is to be determined by the 

25 methods and apparatus of the invention. 

Binding pair shall mean a pair of molecules that interact by means of specific 
non-covalent interactions that depend on the three-dimensional structures of the 
molecules involved. Typical pairs of specific binding partners include antigen-antibody, 
hapten-antibody, hormone-receptor, nucleic acid strand-complementary nucleic acid 

30 strand, substrate-enzyme, substrate analog-enzyme, inhibitor-enzyme, carbohydrate- 
lectin, biotin-avidin, and virus-cellular receptor. 
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As used herein, the term "discriminating key sequence' 1 refers to a sequence 
consisting of at least one of each of the four deoxyribonucleotides (/.<?., A, C, G T T). The 
same discriminating sequence can be used for an entire library of DNA fragments. 
Alternatively, different discriminating key sequences can be used to track libraries of 
5 DNA fragments derived from different organisms. 

As used herein, the term "plurality of molecules" refers to DNA isolated from the 
same source, whereby different organisms may be prepared separately by the same 
method. In one embodiment, the plurality of DNA samples is derived from large 
segments of DNA, whole genome DNA, cDNA, viral DNA or from reverse transcripts of 
10 viral RNA. This DNA may be derived from any source, including mammal (i.e.. human, 
nonhuman primate, rodent or canine), plant, bird, reptile, fish, fungus, bacteria or virus. 

As used herein, the term "library" refers to a subset of smaller sized DNA species 
generated from a single DNA template, either segmented or whole genome. 

As used herein, the term "unique", as in "unique PCR priming regions" refers to a 
15 sequence that does not exist or exists at an extremely low copy level within the DNA 
molecules lo be amplified or sequenced. 

As used herein, the term "compatible" refers to an end of double stranded DNA to 
which an adaptor molecule may be attached (i.e., blunt end or cohesive end). 

As used herein, the term "fragmenting" refers to a process by which a larger 
20 molecule of DNA is converted into smaller pieces of DNA. 

As used herein, "large template DNA" would be DNA of more than 25kb, 
preferably more than 500kb, more preferably more than 1MB, and most preferably 5MB 
or larger. 

As used herein, the term "stringent hybridization conditions" refers to those 
25 conditions under which only complimentary sequences will hybridize to each other. 

The invention described here is generally a system and methods for processing 
nucleic acids. The system and methods can be used to process nucleic acids in a 
multitude of ways that utilize sequencing of nucleic acids. Such sequencing can be 
30 performed to determine the identity of a sequence of nucleic acids, or for single 

nucleotide polymorphism detection in nucleic acid fragments, for nucleic acid expression 
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profiling (comparing the nucleic acid expression profile between two or more states - 
e.g., comparing between diseased and normal tissue or comparing between untreated 
tissue and tissue treated with drug, enzymes, radiation or chemical treatment), for 
haplotying (comparing genes or variations in genes on each of the two alleles present in a 
5 human subject), for karyotyping (diagnostically comparing one or more genes in a test 
tissue - typically from an embryo/fetus prior to conception to detect birth defects with 
the same genes from "normal'* karyotyped subjects), and for genotyping (comparing one 
or more genes in a first individual of a species with the same genes in other individuals of 
the same species). 

10 The system has a number of components. These include (1) the nucleic acid 

template that is to be processed, (2) a picotiter plate for containing the nucleic acid 
template, (3) a flow chamber and fluid delivery means that permits flow of nucleic acid 
processing reagents over the nucleic acid template where the processing reagents 
generate light as the nucleic acid is processed, (4) a light capture means that detects light 

15 emitted as the nucleic acid is processed and that converts the captured light into data, and 
(5) data processing means that processes the data to yield meaningful information about 
the nucleic acid that has been processed. Each of these components of the system will be 
discussed in detail below. 

20 1. NUCLEIC ACID TEMPLATE AND PREPARATION THEREOF 

Nucleic Acid Templates 

The nucleic acid templates that can be sequenced according to the invention, e.g., 
a nucleic acid library, in general can include open circular or closed circular nucleic acid 
molecules. A "closed circle" is a covalently closed circular nucleic acid molecule, e.g., a 
25 circular DNA or RNA molecule. An "open circle" is a linear single-stranded nucleic acid 
molecule having a 5' phosphate group and a 3 1 hydroxyl group. 

In one embodiment, the single stranded nucleic acid contains at least 100 copies 
of a specific nucleic acid sequence, each copy covalently linked end to end. In some 
embodiments, the open circle is formed in situ from a linear double-stranded nucleic acid 
30 molecule. The ends of a given open circle nucleic acid molecule can be ligated by DNA 
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ligase. Sequences at the 5" and 3' ends of the open circle molecule are complementary to 
two regions of adjacent nucleotides in a second nucleic acid molecule, e.g., an adapter 
region of an anchor primer (sometimes referred to as an adapter), or to two regions that 
are nearly adjoining in a second DNA molecule. Thus, the ends of the open-circle 
5 molecule can be ligated using DNA ligase, or extended by DNA polymerase in a gap- 
filling reaction. Open circles are described in detail in Lizardi, U.S. Pat. No. 5,854,033, 
fully incorporated herein by reference. An open circle can be converted to a closed circle 
in the presence of a DNA ligase (for DNA) or RNA ligase following, e.g., annealing of 
the open circle to an anchor primer. 

10 If desired, nucleic acid templates can be provided as padlock probes. Padlock 

probes are linear oligonucleotides that include target-complementary sequences located at 
each end, and which are separated by a linker sequence. The linkers can be ligated to 
ends of members of a library of nucleic acid sequences that have been, e.g., physically 
sheared or digested with restriction endonucleases. Upon hybridization to a target- 

15 sequence, the 5'- and S'-terminal regions of these linear oligonucleotides are brought in 
juxtaposition. This juxtaposition allows the two probe segments (if properly hybridized) 
to be covalently-bound by enzymatic ligation (e.g., with T4 DNA ligase), thus converting 
the probes to circularly-closed molecules which are catenated to the specific target 
sequences (see e.g., Nilsson, et aL, 1994. Science 265: 2085-2088). The resulting probes 

20 are suitable for the simultaneous analysis of many gene sequences both due to their 

specificity and selectivity for gene sequence variants (see e.g., Lizardi, et aL, 1998. Nat. 
Genet. 19: 225-232; Nilsson, etal., 1997. Nat. Genet. 16: 252-255) and due to the fact 
that the resulting reaction products remain localized to the specific target sequences. 
Moreover, intramolecular ligation of many different probes is expected to be less 

25 susceptible to non-specific cross-reactivity than multiplex PCR-based methodologies 

where non-cognate pairs of primers can give rise to irrelevant amplification products (see 
e.g, Landegren and Nilsson, 1997. Ann. Med. 29: 585-590). 

A starting nucleic acid template library can be constructed comprising either 
single-stranded or double-stranded nucleic acid molecules, provided that the nucleic acid 

30 sequence includes a region that, if present in the library, is available for annealing, or can 
be made available for annealing, to an anchor primer sequence. For example, when used 
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as a template for rolling circle amplification, a region of a double-stranded template 
needs to be at least transiently single-stranded in order to act as a template for extension 
of the anchor primer. 

Library templates can include multiple elements, including, but not limited to, one 
5 or more regions that are complementary to the anchor primer. For example, the template 
libraries may include a region complementary to a sequencing primer, a control 
nucleotide region, and an insert sequence comprised of the sequencing template to be 
subsequently characterized. As is explained in more detail below, the control nucleotide 
region is used to calibrate the relationship between the amount of byproduct and the 
10 number of nucleotides incorporated. As utilized herein the term '•complement' 1 refers to 
nucleotide sequences that are able to hybridize to a specific nucleotide sequence to form a 
matched duplex. 

In one embodiment, a library template includes: (/) two distinct regions that are 
complementary to the anchor primer, (ii) one region homologous to the sequencing 

15 primer, (Hi) one optional control nucleotide region, (iv) an insert sequence of, e.g., 30- 
500, 50-200, or 60-100 nucleotides, that is to be sequenced. The template can, of course, 
include two, three, or all four of these features. 

The template nucleic acid can be constructed from any source of nucleic acid, 
e.g., any cell, tissue, or organism, and can be generated by any art-recognized method. 

20 Suitable methods include, e.g., sonication of genomic DNA and digestion with one or 
more restriction endonucleases (RE) to generate fragments of a desired range of lengths 
from an initial population of nucleic acid molecules. Preferably, one or more of the 
restriction enzymes have distinct four-base recognition sequences. Examples of such 
enzymes include, e.g., Sau3Al, Mspl, and Taql. Preferably, the enzymes are used in 

25 conjunction with anchor primers having regions containing recognition sequences for the 
corresponding restriction enzymes. In some embodiments, one or both of the adapter 
regions of the anchor primers contain additional sequences adjoining known restriction 
enzyme recognition sequences, thereby allowing for capture or annealing to the anchor 
primer of specific restriction fragments of interest to the anchor primer. In other 

30 embodiments, the restriction enzyme is used with a type IIS restriction enzyme. 
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Alternatively, template libraries can be made by generating a complementary 
DNA (cDNA) library from RNA, e.g., messenger RNA (mRNA). The cDNA library can, 
if desired, be further processed with restriction endonucleases to obtain a 3' end 
characteristic of a specific RNA, internal fragments, or fragments including the 3' end of 
5 the isolated RNA. Adapter regions in the anchor primer may be complementary to a 
sequence of interest that is thought to occur in the template library, e.g., a known or 
suspected sequence polymorphism within a fragment generated by endonuclease 
digestion. 

In one embodiment, an indexing oligonucleotide can be attached to members of a 
10 template library to allow for subsequent correlation of a template nucleic acid with a 
population of nucleic acids from which the template nucleic acid is derived. For 
example, one or more samples of a starting DNA population can be fragmented 
separately using any of the previously disclosed methods (e.g., restriction digestion, 
sonication). An indexing oligonucleotide sequence specific for each sample is attached 
15 to, e.g., ligated to, the termini of members of the fragmented population. The indexing 
oligonucleotide can act as a region for circularization, amplification and, optionally, 
sequencing, which permits it to be used to index, or code, a nucleic acid so as to identify 
the starting sample from which it is derived. 

Distinct template libraries made with a plurality of distinguishable indexing 
20 primers can be mixed together for subsequent reactions. Determining the sequence of the 
member of the library allows for the identification of a sequence corresponding to the 
indexing oligonucleotide. Based on this information, the origin of any given fragment 
can be inferred. 

The invention includes a sample preparation process that results in a solid or a 
25 mobile solid substrate array containing a plurality of anchor primers or adapaters 
covalently linked to template nucleic acids. 

When the template nucleic acid is circular, formation of the covalently linked 
anchor primer and one or more copies of the target nucleic acid preferably occurs by 
annealing the anchor primer to a complementary region of a circular nucleic acid, and 
30 then extending the annealed anchor primer with a polymerase to result in formation of a 



17 



Express Mail: EV 399526138 US 21465-510 UT1L 

Mailed January 28, 2004 

nucleic acid containing one or more copies of a sequence complementary lo the circular 
nucleic acid. 

Attachment of the anchor primer to a solid or mobile solid substrate can occur 
before, during, or subsequent to extension of the annealed anchor primer. Thus, in one 
5 embodiment, one or more anchor primers are linked to the solid or a mobile solid 

substrate, after which the anchor primer is annealed to a target nucleic acid and extended 
in the presence of a polymerase. Alternatively, in a second embodiment, an anchor 
primer is first annealed to a target nucleic acid, and a 3 'OH terminus of the annealed 
anchor primer is extended with a polymerase. The extended anchor primer is then linked 
10 to the solid or mobile solid substrate. By varying the sequence of anchor primers, it is 
possible to specifically amplify distinct target nucleic acids present in a population of 
nucleic acids. 

Outlined below is a preferred embodiment for the preparation of template nucleic 
acids for amplification and sequencing reactions. The invention includes a method for 

15 preparing the sample DNA comprised of seven general steps: (a) fragmenting large 

template DNA or whole genomic DNA samples to generate a plurality of digested DNA 
fragments; (b) creating compatible ends on the plurality of digested DNA samples; (c) 
ligating a set of universal adaptor sequences onto the ends of fragmented DNA molecules 
to make a plurality of adaptor-ligated DNA molecules, wherein each universal adaptor 

20 sequence has a known and unique base sequence comprising a common PCR primer 
sequence, a common sequencing primer sequence and a discriminating four base key 
sequence and wherein one adaptor is attached to biotin; (d) separating and isolating the 
plurality of ligated DNA fragments; (e) removing any portion of the plurality of ligated 
DNA fragments; (0 nick repair and strand extension of the plurality of ligated DNA 

25 fragments; (g) attaching each of the ligated DNA fragments to a solid support; and (h) 
isolating populations comprising single-stranded adaptor-ligated DNA fragments for 
which there is a unique adaptor at each end (i.e., providing directionality). 

The following discussion summarizes the basic steps involved in the methods of 
the invention. The steps are recited in a specific order, however, as would be known by 

30 one of skill in the art, the order of the steps may be manipulated to achieve the same 
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result. Such manipulations are contemplated by the inventors. Further, some steps may 
be minimized as would also be known by one of skill in the art. 



Fragmentation 

5 In the practice of the methods of the present invention, the fragmentation of the 

DNA sample can be done by any means known to those of ordinary skill in the art. 
Preferably, the fragmenting is performed by enzymatic or mechanical means. The 
mechanical means may be sonication or pnysical shearing. The enzymatic means may be 
performed by digestion with nucleases (e.g., Deoxyribonuclease 1 (DNase I)) or one or 

10 more restriction endonucleases. In a preferred embodiment, the fragmentation results in 
ends for which the sequence is not known. 

In a preferred embodiment, the enzymatic means is DNase I. DNase I is a 
versatile enzyme that nonspecifically cleaves double-stranded DNA (dsDNA) to release 
5 f -phosphorylated di-, tri-, and oligonucleotide products. DNase I has optimal activity in 

15 buffers containing Mn 2+ , Mg 2+ and Ca 2+ , but no other salts. The purpose of the DNase I 
digestion step is to fragment a large DNA genome into smaller species comprising a 
library. The cleavage characteristics of DNase I will result in random digestion of 
template DNA (i.e., no sequence bias) and in the predominance of blunt-ended dsDNA 
fragments when used in the presence of manganese-based buffers (Melgar, E. and D.A. 

20 Goldthwait. 1968. Deoxyribonucleic acid nucleases. II. The effects of metal on the 
mechanism of action of deoxyribonuclease I. J. Biol. Cheat. 243: 4409). The range of 
digestion products generated following DNase I treatment of genomic templates is 
dependent on three factors: i) amount of enzyme used (units); ii) temperature of digestion 
(°C); and iii) incubation time (minutes). The DNase I digestion conditions outlined 

25 below have been optimized to yield genomic libraries with a size range from 50-700 base 
pairs (bp). 

In a preferred embodiment, the DNase I digests large template DNA or whole 
genome DNA for 1-2 minutes to generate a population of polynucleotides. In another 
preferred embodiment, the DNase I digestion is performed at a temperature between 
30 10°C-37°C. In yet another preferred embodiment, the digested DNA fragments are 
between 50 bp to 700 bp in length. 
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Polishing 

Digestion of genomic DNA (gDNA) templates with DNase I in the presence of 
Mn 2+ will yield fragments of DNA that are either blunt-ended or have protruding termini 
5 with one or two nucleotides in length. In a preferred embodiment, an increased number 
of blunt ends are created with Pfu DNA polymerase. In other embodiments, blunt ends 
can be created with less efficient DNA polymerases such as T4 DNA polymerase or 
Klenow DNA polymerase. Pfu ''polishing" or blunt ending is used to increase the 
amount of blunt-ended species generated following genomic template digestion with 

10 DNase 1. Use of Pfu DNA polymerase for fragment polishing will result in the fill-in of 
5' overhangs. Additionally, Pfu DNA polymerase does not exhibit DNA extendase 
activity but does have 3*-> 5* exonuclease activity that will result in the removal of 
single and double nucleotide extensions to further increase the amount of blunt-ended 
DNA fragments available for adaptor ligation (Costa, G.L. and MP. Weiner. 1 994a. 

1 5 Protocols for cloning and analysis of blunt-ended PCR-generated DNA fragments. PCR 
Methods Appl 3(5):S95; Costa, G.L., A. Grafsky and MP. Weiner. 1994b. Cloning and 
analysis of PCR-generated DNA fragments. PCR Methods Appl 3(6):338; Costa, G.L. 
and M.P. Weiner. 1994c. Polishing with T4 or Pfii polymerase increases the efficiency 
of cloning of PCR products. Nucleic Acids Res, 22(12):2423). 

20 

Adaptor Ligation 

If the libraries of nucleic acids are to be attached to the solid substrate, then 
preferably the nucleic acid templates are annealed to anchor primer sequences using 
recognized techniques (see, e.g., Hatch, etal. 1999. Genet Anal. BiomoL Engineer. 15: 

25 35-40; KooK U.S. Patent No. 5,714, 320 and Lizardi, U.S. Patent No. 5,854,033). In 
general, any procedure for annealing the anchor primers to the template nucleic acid 
sequences is suitable as long as it results in formation of specific, i.e., perfect or nearly 
perfect, complementarity between the adapter region or regions in the anchor primer 
sequence and a sequence present in the template library. 

30 In a preferred embodiment, following fragmentation and blunt ending of the DNA 

library, universal adaptor sequences are added to each DNA fragment. The universal 

20 
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adaptors are designed to include a set of unique PCR priming regions that are typically 20 
bp in length located adjacent to a set of unique sequencing priming regions that are 
typically 20 bp in length optionally followed by a unique discriminating key sequence 
consisting of at least one of each of the four deoxyribonucleotides (i.e., A, C, G, T). In a 
5 preferred embodiment, the discriminating key sequence is 4 bases in length. In another 
embodiment, the discriminating key sequence may be combinations of 1-4 bases. In yet 
another embodiment, each unique universal adaptor is forty- four bp (44 bp) in length. In 
a preferred embodiment the universal adaptors are ligated, using T4 DNA ligase, onto 
each end of the DNA fragment to generate a total nucleotide addition of 88 bp to each 
10 DNA fragment. Different universal adaptors are designed specifically for each DNA 

library preparation and will therefore provide a unique identifier for each organism. The 
size and sequence of the universal adaptors may be modified as would be apparent to one 
of skill in the art. 

For example, to prepare two distinct universal adaptors (i.e., "first" and 

1 5 "second"), single-stranded oligonucleotides may be ordered from a commercial vendor 
(/.ft, Integrated DNA Technologies, IA or Operon Technologies, CA). In one 
embodiment, the universal adaptor oligonucleotide sequences are modified during 
synthesis with two or three phosphorothioate linkages in place of phosphodiester linkages 
at both the 5' and 3' ends. Unmodified oligonucleotides are subject to rapid degradation 

20 by nucleases and are therefore of limited utility. Nucleases are enzymes that catalyze the 
hydrolytic cleavage of a polynucleotide chain by cleaving the phosphodiester linkage 
between nucleotide bases. Thus, one simple and widely used nuclease-resistant 
chemistry available for use in oligonucleotide applications is the phosphorothioate 
modification. In phosphorothioates, a sulfur atom replaces a non-bridging oxygen in the 

25 oligonucleotide backbone making it resistant to all forms of nuclease digestion (i.e. 
resistant to both endonuclease and exonuclease digestion). Each oligonucleotide is 
HPLC-purified to ensure there are no contaminating or spurious oligonucleotide 
sequences in the synthetic oligonucleotide preparation. The universal adaptors are 
designed to allow directional ligation to the blunt-ended, fragmented DNA. Each set of 

30 double-stranded universal adaptors are designed with a PCR priming region that contains 
noncomplementary 5' four-base overhangs that cannot ligate to the blunt-ended DNA 
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fragment as well as prevent ligation with each other at these ends. Accordingly, binding 
can only occur between the 3 1 end of the adaptor and the 5* end of the DNA fragment or 
between the 3' end of the DNA fragment and the 5' end of the adaptor. Double-stranded 
universal adaptor sequences are generated by using single-stranded oligonucleotides that 

5 are designed with sequences that allow primarily complimentary oligonucleotides to 
anneal, and to prevent cross-hybridization between two non-complimentary 
oligonucleotides. In one embodiment, 95% of the universal adaptors are formed from the 
annealing of complimentary oligonucleotides. In a preferred embodiment, 97% of the 
universal adaptors are formed from the annealing of complimentary oligonucleotides. In 

10 a more preferred embodiment, 99% of the universal adaptors are formed from the 

annealing of complimentary oligonucleotides. In a most preferred embodiment, 100% of 
the universal adaptors are formed from the annealing of complimentary oligonucleotides. 

One of the two adaptors can be linked to a support binding moiety. In a preferred 
embodiment, a 5' biotin is added to the first universal adaptor to allow subsequent 

1 5 isolation of ssDNA template and noncovalent coupling of the universal adaptor to the 
surface of a solid support that is saturated with a biotin-binding protein (i.e. streptavidin, 
neutravidin or avidin). Other linkages are well known in the art and may be used in place 
of biotin-streptavidin (for example antibody/antigen-epitope, receptor/ligand and 
oligonucleotide pairing or complimentarity) one embodiment, the solid support is a bead, 

20 preferably a polystyrene bead. In one preferred embodiment, the bead has a diameter of 
about 2.8 urn. As used herein, this bead is referred to as a "sample prep bead" 

Each universal adaptor may be prepared by combining and annealing two ssDNA 
oligonucleotides, one containing the sense sequence and the second containing the 
antisense (complementary) sequence. Schematic representation of the universal adaptor 

25 design is outlined in Figure 2. 

Isolation of Ligation Products 

The universal adaptor ligation results in the formation of fragmented DNAs with 
adaptors on each end, unbound single adaptors, and adaptor dimers. In a preferred 
30 embodiment, agarose gel electrophoresis is used as a method to separate and isolate the 
adapted DNA library population from the unligated single adaptors and adaptor dimer 
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populations. In other embodiments, the fragments may be separated by size exclusion 
chromatography or sucrose sedimentation. The procedure of DNase 1 digestion of DNA 
typically yields a library population that ranges from 50-700 bp. In a preferred 
embodiment, upon conducting agarose gel electrophoresis in the presence of a DNA 

5 marker, the addition of the 88 bp universal adaptor set will shift the DNA library 
population to a larger size and will result in a migration profile in the size range of 
approximately 130-800 bp; adaptor dimers will migrate at 88 bp; and adaptors not ligated 
will migrate at 44 bp. Therefore, numerous double-stranded DNA libraries in sizes 
ranging from 200-800 bp can be physically isolated from the agarose gel and purified 

10 using standard gel extraction techniques. In one embodiment, gel isolation of the adapted 
ligated DNA library will result in the recovery of a library population ranging in size 
from 200-400 bp. Other methods of distinguishing adaptor-ligated fragments are known 
to one of skill in the art. 

15 Nick Repair 

Because the DNA oligonucleotides used for the universal adaptors are not 5' 
phosphorylated, gaps will be present at the 3* junctions of the fragmented DNAs 
following ligase treatment (see Figure 3 A). These two "gaps" or u nicks" can be filled in 
by using a DNA polymerase enzyme that can bind to, strand displace and extend the 

20 nicked DNA fragments. DNA polymerases that lack 3'-> 5' exonuclease activity but 
exhibit 5 1 -> 3' exonuclease activity have the ability to recognize nicks, displace the 
nicked strands, and extend the strand in a manner that results in the repair of the nicks 
and in the formation of non-nicked double-stranded DNA (see Figure 3B and 3C) 
(Hamilton, S.C., J.W. Farchaus and M.C. Davis. 2001 . DNA polymerases as engines for 

25 biotechnology. BioTeclmiques 31:370). 

Several modifying enzymes are utilized for the nick repair step, including but not 
limited to polymerase, ligase and kinase. DNA polymerases that can be used for this 
application include, for example, E. coli DNA pol I, Thermoanaerobacter 
thermohydrosulfaricus pol I, and bacteriophage phi 29. In a preferred embodiment, the 

30 strand displacing enzyme Bacillus stearothermophilus pol I (Bst DNA polymerase I) is 
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used to repair the nicked dsDNA and results in non-nicked dsDNA (see Figure 3D). In 
another preferred embodiment, the ligase is T4 and the kinase is polynucleotide kinase. 

Isolation of Single-Stranded DNA 

5 Following the generation of non-nicked dsDNA, ssDNAs comprising both the 

first and second adaptor molecules are to be isolated (desired populations are designated 
below with asterisks; "A" and "B" correspond to the first and second adaptors). Double- 
stranded DNA libraries will have adaptors bound in the following configurations: 

1 0 Universal Adaptor A - DNA fragment - Universal Adaptor A 
Universal Adaptor B - DNA fragment - Universal Adaptor A* 
Universal Adaptor A - DNA fragment - Universal Adaptor B* 
Universal Adaptor B - DNA fragment - Universal Adaptor B 

1 5 Universal adaptors are designed such that only one universal adaptor has a 5 ? 

biotin moiety. For example, if universal adaptor B has a 5'biotin moiety, streptavidin- 
coated sample prep beads can be used to bind all double-stranded DNA library species 
with universal adaptor B. Genomic library populations that contain two universal adaptor 
A species will not contain a 5' biotin moiety and will not bind to streptavidin-containing 

20 sample prep beads and thus can be washed away. The only species that will remain 

attached to beads are those with universal adaptors A and B and those with two universal 
adaptor B sequences. DNA species with two universal adaptor B sequences (i.e., biotin 
moieties at each 5 'end) will be bound to streptavidin-coated sample prep beads at each 
end, as each strand comprised in the double strand will be bound. Double-stranded DNA 

25 species with a universal adaptor A and a universal adaptor B will contain a single 5'biotin 
moiely and thus will be bound to streptavidin-coated beads at only one end. The sample 
prep beads are magnetic, therefore, the sample prep beads will remain coupled to a solid 
support when magnetized. Accordingly, in the presence of a low-salt ("melt" or 
denaturing) solution, only those DNA fragments that contain a single universal adaptor A 

30 and a single universal adaptor B sequence will release the complementary unbound 
strand. This single-stranded DNA population may be collected and quantitated by, for 
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example, pyrophosphate sequencing, real-time quantitative PCR, agarose gel 
electrophoresis or capillary gel electrophoresis. 

Attachment of Template to Beads 

5 In one embodiment, ssDNA libraries that are created according to the methods of 

the invention are quantitated to calculate the number of molecules per unit volume. These 
molecules are annealed to a solid support (bead) that contain oligonucleotide capture 
primers that are complementary to the PCR priming regions of the universal adaptor ends 
of the ssDNA species. Beads are then transferred to an amplification protocol. Clonal 

1 0 populations of single species captured on DNA beads may then sequenced. In one 
embodiment, the solid support is a bead, preferably a sepharose bead. As used herein, 
this bead is referred to as a "DNA capture bead". 

The beads used herein may be of any convenient size and fabricated from any 
number of known materials. Example of such materials include: inorganics, natural 

15 polymers, and synthetic polymers. Specific examples of these materials include: 
cellulose, cellulose derivatives, acrylic resins, glass; silica gels, polystyrene, gelatin, 
polyvinyl pyrrolidone, co-polymers of vinyl and acrylamide, polystyrene cross-linked 
with divinylbenzene or the like (see, Merrifield Biochemistry 1964, 3, 1385-1390), 
polyacrylamides, latex gels, polystyrene, dextran, rubber, silicon, plastics, nitrocellulose, 

20 celluloses, natural sponges, silica gels, glass, metals plastic, cellulose, cross-linked 
dextrans (e.g., Sephadex™) and agarose gel (Sepharose™) and solid phase supports 
known to those of skill in the art. In one embodiment, the diameter of the DNA capture 
bead is in the range of 20-70jam. In a preferred embodiment, the diameter of the DNA 
capture bead is in a range of 20-50jim. In a more preferred embodiment, the diameter of 

25 the DNA capture bead is about 30|im. 

In one aspect, the invention includes a method for generating a library of solid 
supports comprising: (a) preparing a population of ssDNA templates according to the 
methods disclosed herein; (b) attaching each DNA template to a solid support such that 
there is one molecule of DNA per solid support; (c) amplifying the population of single- 

30 stranded templates such that the amplification generates a clonal population of each DNA 
fragment on each solid support; (d) sequencing clonal populations of beads. 

25 
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In one embodiment, the solid support is a DNA capture bead. In another 
embodiment, the DNA is genomic DNA, cDNA or reverse transcripts of viral RNA. The 
DNA may be attached to the solid support, for example, via a biotin-streptavidin linkage, 
a covalcnt linkage or by complementary oligonucleotide hybridization. In one 
5 embodiment, each DNA template is ligated to a set of universal adaptors. In another 
embodiment, the universal adaptor pair comprises a common PCR primer sequence, a 
common sequencing primer sequence and a discriminating key sequence. Single- 
stranded DNAs are isolated that afford unique ends; single stranded molecules are then 
attached to a solid support and exposed to amplification techniques for clonal expansion 

10 of populations. The DNA may be amplified by PCR. 

In another aspect, the invention provides a library of solid supports made by the 
methods described herein. 

The nucleic acid template (e.g., DNA template) prepared by this method may be 
used for many molecular biological procedures, such as linear extension, rolling circle 

15 amplification, PCR and sequencing. This method can be accomplished in a linkage 
reaction, for example, by using a high molar ratio of bead to DNA. Capture of single- 
stranded DNA molecules will follow a poisson distribution and will result in a subset of 
beads with no DNA attached and a subset of beads with two molecules of DNA attached. 
In a preferred embodiment, there would be one bead to one molecule of DNA. In 

20 addition, it is possible to include additional components in the adaptors that may be 
useful for additional manipulations of the isolated library. 

2. NUCLEIC ACID TEMPLATE AMPLIFICATION 

In order for the nucleic acid template to be sequenced according to the methods of 
25 this invention the copy number must be amplified to generate a sufficient number of 
copies of the template to produce a detectable signal by the light detection means. Any 
suitable nucleic acid amplification means maybe used. 

A number of//; vitro nucleic acid amplification techniques have been described. 
These amplification methodologies may be differentiated into those methods: (/) which 
30 require temperature cycling - polymerase chain reaction (PCR) (see e.g., Saiki, ei ai, 

1 995. Science 230: 1 350- 1 354), ligase chain reaction (see e.g., Barany, 1991. Proc. Nail. 
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Acad, ScL USA 88: 189-193; Barringer, et ai t 1990. 89: 1 17-122) and 
transcription-based amplification (see e.g.. Kwoh, et ai t 1989. Proc. Natl. Acad. ScL 
USA 86: 1 173-1 1 77) and (/'/) isothermal amplification systems - self-sustaining, sequence 
replication (see e.g., Guatelli, et aL t 1990. Proc. Natl Acad. ScL USA 87: 1874-1878); 
5 the Qp replicase system (see e.g., Lizardi, et ai, 1988. BioTechnology 6: 1 197-1202); 
strand displacement amplification Nucleic Acids Res. 1992 Apr 1 1 ;20(7): 1691-6.; and 
the methods described in PNAS 1992 Jan l;89(l):392-6; and NASBA J Virol Methods. 
1991 Dec;35(3):273-86. 

In one embodiment, isothermal amplification is used. Isothermal amplification 

10 also includes rolling circle-based amplification (RCA). RCA is discussed in, e.g., KooL 
U.S. Patent No. 5,714,320 and Lizardi, U.S. Patent No. 5,854,033; Hatch, et ai, 1999. 
Genet. Anal. BiomoL Engineer 15: 35-40. The result of the RCA is a single DNA strand 
extended from the 3' terminus of the anchor primer (and thus is linked to the solid 
support matrix) and including a concatamer containing multiple copies of the circular 

1 5 template annealed to a primer sequence. Typically, 1 ,000 to 1 0,000 or more copies of 
circular templates, each having a size of, e.g., approximately 30-500, 50-200, or 60-100 
nucleotides size range, can be obtained with RCA. 

The product of RCA amplification following annealing of a circular nucleic acid 
molecule to an anchor primer is shown schematically in Figure 1 1 A. A circular template 

20 nucleic acid 102 is annealed to an anchor primer 104, which has been linked to a surface 
106 at its 5' end and has a free 3 f OH available for extension. The circular template 
nucleic acid 102 includes two adapter regions 108 and 1 10 which are complementary to 
regions of sequence in the anchor primer 104. Also included in the circular template 
nucleic acid 102 is an insert 112 and a region 1 14 homologous to a sequencing primer, 

25 which is used in the sequencing reactions described below. 

Upon annealing, the free 3'-OH on the anchor primer 104 can be extended using 
sequences within the template nucleic acid 102. The anchor primer 102 can be extended 
along the template multiple times, with each iteration adding to the sequence extended 
from the anchor primer a sequence complementary to the circular template nucleic acid. 

30 Four iterations, or four rounds of rolling circle replication, arc shown in Figure 1 1 A as 
the extended anchor primer amplification product 1 14. Extension of the anchor primer 

27 



Express Mail: EV 399526138 US 21465-510 UT1L 

Mailed January 28, 2004 

results in an amplification product covalently or otherwise physically attached to the 
substrate 106. A number of hi vitro nucleic acid amplification techniques may be utilized 
to extend the anchor primer sequence. The amplification is typically performed in the 
presence of a polymerase, e.g., a DNA or RNA-directcd DNA polymerase, and one, two, 
5 three, or four types of nucleotide triphosphates, and, optionally, auxiliary binding 

proteins. In general, any polymerase capable of extending a primed 3 ? -OH group can be 
used a long as it lacks a 3' to 5' exonuclease activity. Suitable polymerases include, e.g., 
the DNA polymerases from Bacillus stearothermophilus y Thermits acquaticus, 
Pyrococcus furiosis, Thermococcus litoralis, and Thermus thermophilic, bacteriophage 

10 T4 and T7, and the E. coli DNA polymerase I Klenow fragment. Suitable RNA-directed 
DNA polymerases include, e.g., the reverse transcriptase from the Avian Myeloblastosis 
Virus, the reverse transcriptase from the Moloney Murine Leukemia Virus, and the 
reverse transcriptase from the Human Immunodeficiency Virus-1. 

Additional embodiments of circular templates and anchor primers are shown in 

15 more detail in Figure 1 1 B- 1 1 D. Figure 1 1 B illustrates an annealed open circle linear 

substrate that can serve, upon ligation, as a template for extension of an anchor primer. A 
template molecule having the sequence 5' - teg tgt gag gtc tea gca let tat gta tat tta ctt 
eta ttc tea gtt gec taa get gca gec a - 3' (SEQ ID NO:5) is annealed to an anchor primer 
having a biotin linker at its 5' terminus and the sequence 5 ? -gac etc aca cga tgg ctg cag 

20 ctt - 3* (SEQ ID NO:6). Annealing of the template results in juxtaposition of the 5" and 
3* ends of the template molecule. The 3'OH of the anchor primer can be extended using 
the circular template. 

The use of a circular template and an anchor primer for identification of single 
nucleotide polymorphisms is shown in Figure 1 1C. Shown is a generic anchor primer 

25 having the sequence 5' - gac etc aca cga tgg ctg cag ctt - 3'(SEQ ID NO:7). The anchor 
primer anneals to an SNP probe having the sequence 5' - ttt ata tgt att eta cga etc tgg agt 
gtg eta ccg acg teg aat ccg ttg act ctt ate ttc a - 3' (SEQ ID NO:8). The SNP probe in 
turn hybridizes to a region of a SNP-containing region of a gene having the sequence 5' 
- eta get cgt aca tat aaa tga aga taa gat cct g - 3' (SEQ ID NO:9). Hybridization of a 

30 nucleic acid sequence containing the polymorphism to the SNP probe complex allows for 
subsequent ligation and circularization of the SNP probe. The SNP probe is designed so 
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that its 5 r and 3' termini anneal to the genomic region so as to abut in the region of the 
polymorphic site, as is indicated in Figure 1 1 C. The circularized SNP probe can be 
subsequently extended and sequenced using the methods described herein. A nucleic 
acid lacking the polymorphism does not hybridize so as to result in juxtaposition of the 5 ? 
5 and 3' termini of the SNP probe. In this case, the SNP probe cannot be ligated to form a 
circular substrate needed for subsequent extension. 

Figure 1 ID illustrates the use of a gap oligonucleotide to along with a circular 
template molecule. An anchor primer having the sequence 5'-gac etc aca cga gta gca 
tgg ctg cag ctt - 3 ? (SEQ ID NO: 10) is attached to a surface through a biotin linker. A 

10 template molecule having the sequence 5' - teg tgt gag gtc tea gca tct tat gta tat tta ctt eta 
ttc tea gtt gec taa get gca gec a - 3' (SEQ ID NO:l 1) is annealed to the anchor primer to 
result in partially single stranded, or gapped region, in the anchor primer flanked by a 
double-stranded region. A gapping molecule having the sequence 5' - tgc tac - 3* then 
anneals to the anchor primer. Ligation of both ends of the gap oligonucleotide to the 

15 template molecule results in formation of a circular nucleic acid molecule that can act as 
a template for rolling circle amplification. 

RCA can occur when the replication of the duplex molecule begins at the origin. 
Subsequently, a nick opens one of the strands, and the free 3Merminal hydroxyl moiety 
generated by the nick is extended by the action of DNA polymerase. The newly 

20 synthesized strand eventually displaces the original parental DNA strand. This 

aforementioned type of replication is known as rolling-circle replication (RCR) because 
the point of replication may be envisaged as "rolling around" the circular template strand 
and, theoretically, it could continue to do so indefinitely. Additionally, because the 
newly synthesized DNA strand is covalently-bound to the original template, the displaced 

25 strand possesses the original genomic sequence (e.g., gene or other sequence of interest) 
at its 5'-tcrminus. In RCR, the original genomic sequence is followed by any number of 
^replication units" complementary to the original template sequence, wherein each 
replication unit is synthesized by continuing revolutions of said original template 
sequence. Hence, each subsequent revolution displaces the DNA which is synthesized in 

30 the previous replication cycle. 
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Through the use of the RCA reaction, a strand may be generated which represents 
many tandem copies of the complement to the circularized molecule. For example, RCA 
has recently been utilized to obtain an isothermal cascade amplification reaction of 
circularized padlock probes in vitro in order to detect single-copy genes in human 

5 genomic DNA samples (see Lizardi, et al. 1998. Nat. Genet. 19: 225-232). In addition, 
RCA has also been utilized to detect single DNA molecules in a solid phase-based assay, 
although difficulties arose when this technique was applied to in situ hybridization (see 
Lizardi, et al., 1998. Nat. Genet. 19: 225-232). 

If desired, RCA can be performed at elevated temperatures, e.g., at temperatures 

10 greater than 37° C, 42° C, 45° C, 50° C, 60° C, or 70° C. In addition, RCA can be 

performed initially at a lower temperature, e.g., room temperature, and then shifted to an 
elevated temperature. Elevated temperature RCA is preferably performed with 
thermostable nucleic acid polymerases and with primers that can anneal stably and with 
specificity at elevated temperatures. 

1 5 RCA can also be performed with non-naturally occurring oligonucleotides, e.g., 

peptide nucleic acids. Further, RCA can be performed in the presence of auxiliary 
proteins such as single-stranded binding proteins. 

The development of a method of amplifying short DNA molecules which have 
been immobilized to a solid support, termed RCA has been recently described in the 

20 literature (see e.g., Hatch, et al, 1999. Genet. Anal. Biomol. Engineer. 15: 35-40; Zhang, 
etaL, 1998. Gene 211: 277-85; Baner, et ai, 1998. Nucl. Acids Res. 26: 5073-5078; Liu, 
etaL, 1995. J. Am. Chenu Soc. 118: 1587-1594; Fire and Xu, 1995. Proc. Natl. Acad. 
Sci. USA 92: 4641-4645; Nilsson, et ai, 1994. Science 265: 2085-2088). RCA targets 
specific DNA sequences through hybridization and a DNA ligase reaction. The circular 

25 product is then subsequently used as a template in a rolling circle replication reaction. 
Other examples of isothermal amplification systems include, e.g., (/) self- 
sustaining, sequence replication (see e.g., Guatelli, et al., 1990. Proc. Natl. Acad. Sci. 
USA 87: 1874-1878), (//) the Qp replicase system (see e.g., Lizardi, et al. t 1988. 
BioTechnology 6: 1 1 97- 1 202), and (///) nucleic acid sequence-based amplification 

30 (NASBA™; see Kievits, et al, 1991. J. Virol Methods 35: 273-286). 
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PCR Amplification of Nucleic Acid Templates 

In a preferred embodiment, polymerase chain reaction ("PCR") is used to 
generate additional copies of the template nucleic acids. The PCR amplification step may 
be performed prior to distribution of the nucleic acid templates onto the picotiter plate or 
5 may be performed after the nucleic acid templates have been distributed onto the picotiter 
plate. 

Bead Emulsion PCR Amplification 

In a preferred embodiment, a PCR amplification step is performed prior to 
10 distribution of the nucleic acid templates onto the picotiter plate. 

In a particularly preferred embodiment, a novel amplification system, herein 
termed "bead emulsion amplification" is performed by attaching a template nucleic acid 
(e.g., DNA) to be amplified to a solid support, preferably in the form of a generally 
spherical bead. A library of single stranded template DNA prepared according to the 
15 sample prepration methods of this invention is an example of one suitable source of the 
starting nucleic acid template library to be attached to a bead for use in this amplification 
method. 

The bead is linked to a large number of a single primer species (i.e., primer B in 
Figure 6) that is complementary to a region of the template DNA. Template DNA 

20 annealed to the bead bound primer. The beads are suspended in aqueous reaction mixture 
and then encapsulated in a water-in-oil emulsion. The emulsion is composed of discrete 
aqueous phase microdroplets, approximately 60 to 200 um in diameter, enclosed by a 
thermostable oil phase. Each microdroplet contains, preferably, amplification reaction 
solution (i.e., the reagents necessary for nucleic acid amplification). An example of an 

25 amplification would be a PCR reaction mix (polymerase, salts, dNTPs) and a pair of PCR 
primers (primer A and primer B). See, Figure 6A. A subset of the microdroplet 
population also contains the DNA bead comprising the DNA template. This subset of 
microdroplet is the basis for the amplification. The microcapsules that are not within this 
subset have no template DNA and will not participate in amplification. In one 

30 embodiment, the amplification technique is PCR and the PCR primers are present in a 8:1 
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or 16:1 ratio (i.e., 8 or 16 of one primer to 1 of the second primer) to perform asymmetric 
PCR. 

In this overview, the DNA is annealed to an oligonucleotide (primer B) which is 
immobilized to a bead. During thermocycling (Figure 6B), the bond between the single 
5 stranded DNA template and the immobilized B primer on the bead is broken, releasing 
the template into the surrounding microencapsulated solution. The amplification 
solution, in this case, the PCR solution, contains addition solution phase primer A and 
primer B. Solution phase B primers readily bind to the complementary b' region of the 
template as binding kinetics are more rapid for solution phase primers than for 
10 immobilized primers. In early phase PCR, both A and B strands amplify equally well 
(Figure 6C). 

By midphase PCR (i.e.. between cycles 10 and 30) the B primers are depleted, 
halting exponential amplification. The reaction then enters asymmetric amplification and 
the amplicon population becomes dominated by A strands (Figure 6D). In late phase 

1 5 PCR (Figure 6E), after 30 to 40 cycles, asymmetric amplification increases the 
concentration of A strands in solution. Excess A strands begin to anneal to bead 
immobilized B primers. Thermostable polymerases then utilize the A strand as a 
template to synthesize an immobilized, bead bound B strand of the amplicon. 

In final phase PCR (Figure 6F), continued thermal cycling forces additional 

20 annealing to bead bound primers. Solution phase amplification may be minimal at this 
stage but concentration of immobilized B strands increase. Then, the emulsion is broken 
and the immobilized product is rendered single stranded by denaturing (by heat, pH etc.) 
which removes the complimentary A strand. The A primers are annealed to the A' region 
of immobilized strand, and immobilized strand is loaded with sequencing enzymes, and 

25 any necessary accessory proteins. The beads are then sequenced using recognized 
pyrophosphate techniques (described, e.g., in US patent 6,274,320 ? 6258,568 and 
6,210,891, incorporated in toto herein by reference). 
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Template design 

In a preferred embodiment, the DNA template to be amplified by bead emulsion 
amplification can be a population of DNA such as, for example, a genomic DNA library 
5 or a cDNA library. It is preferred that each member of the population have a common 
nucleic acid sequence at the first end and a common nucleic acid sequence at a second 
end. This can be accomplished, for example, by ligating a first adaptor DNA sequence to 
one end and a second adaptor DNA sequence to a second end of the DNA population. 
Many DNA and cDNA libraries, by nature of the cloning vector (e.g., Bluescript, 

1 0 Stratagene, La Jolla, C A) fit this description of having a common sequence at a first end 
and a second common sequence at a second end of each member DNA. The DNA 
template may be of any size amenable to in vitro amplification (including the preferred 
amplification techniques of PCR and asymmetric PCR). In a preferred embodiment, the 
DNA template is between about 150 to 750 bp in size, such as, for example about 250 bp 

15 in size. 

Binding Nucleic Acid Template to Capture Beads 

In a first step, a single stranded nucleic acid template to be amplified is attached 
to a capture bead. The nucleic acid template may be attached to the solid support capture 

20 bead in any manner known in the art. Numerous methods exist in the art for attaching 
DNA to a solid support such as the preferred microscopic bead. According to the present 
invention, covalent chemical attachment of the DNA to the bead can be accomplished by 
using standard coupling agents, such as water-soluble carbodiimide, to link the 5'- 
phosphate on the DNA to amine-coated capture beads through a phosphoamidate bond. 

25 Another alternative is to first couple specific oligonucleotide linkers to the bead using 
similar chemistry, and to then use DNA ligase to link the DNA to the linker on the bead. 
Other linkage chemistries to join the oligonucleotide to the beads include the use of N- 
hydroxysuccinamide (NHS) and its derivatives. In such a method, one end of the 
oligonucleotide may contain a reactive group (such as an amide group) which forms a 

30 covalent bond with the solid support, while the other end of the linker contains a second 
reactive group that can bond with the oligonucleotide to be immobilized. In a preferred 
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embodiment, the oligonucleotide is bound to the DNA capture bead by covalent linkage. 
However, non-covalent linkages, such as chelation or antigen-antibody complexes, may 
also be used to join the oligonucleotide to the bead. 

Oligonucleotide linkers can be employed which specifically hybridize to unique 
5 sequences at the end of the DNA fragment, such as the overlapping end from a restriction 
enzyme site or the "sticky ends' 1 of bacteriophage lambda based cloning vectors, but 
blunt-end ligations can also be used beneficially. These methods are described in detail 
in US 5,674,743. It is preferred that any method used to immobilize the beads will 
continue to bind the immobilized oligonucleotide throughout the steps in the methods of 
10 the invention. 

In one embodiment, each capture bead is designed to have a plurality of nucleic 
acid primers that recognize (i.e., are complementary to) a portion of the nucleic template, 
and the nucleic acid template is thus hybridized to the capture bead. In the methods 
described herein, clonal amplification of the template species is desired, so it is preferred 

15 that only one unique nucleic acid template is attached to any one capture bead. 

The beads used herein may be of any convenient size and fabricated from any 
number of known materials. Example of such materials include: inorganics, natural 
polymers, and synthetic polymers. Specific examples of these materials include: 
cellulose, cellulose derivatives, acrylic resins, glass, silica gels, polystyrene, gelatin, 

20 polyvinyl pyrrolidone, co-polymers of vinyl and acrylamide, polystyrene cross-linked 
with divinylbenzene or the like (as described, e.g., in Merrifield, Biochemistry 1964, 3, 
1385-1390), polyacrylamides, latex gels, polystyrene, dextran, rubber, silicon, plastics, 
nitrocellulose, natural sponges, silica gels, control pore glass, metals, cross-linked 
dextrans (e.g., Sephadex™) agarose gel (Sepharose™), and solid phase supports known 

25 to those of skill in the art. In a preferred embodiment, the capture beads are Sepharose 
beads approximately 25 to 40 urn in diameter. 



Emulsification 

Capture beads with attached single strand template nucleic acid are emulsified as 
30 a heat stable water-in-oil emulsion. The emulsion may be formed according to any 

suitable method known in the art. One method of creating emulsion is described below 
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but any method for making an emulsion may be used. These methods are known in the 
art and include adjuvant methods, counterflow methods, crosscurrent methods, rotating 
drum methods, and membrane methods. Furthermore, the size of the microcapsules may 
be adjusted by varying the flow rate and speed of the components. For example, in 
5 dropwise addition, the size of the drops and the total time of delivery may be varied. 
Preferably, the emulsion contains a density of bead "microreactors" at a density of about 
3,000 beads per microliter. 

The emulsion is preferably generated by suspending the template-attached beads 
in amplification solution. As used herein, the term "amplification solution" means the 

10 sufficient mixture of reagents that is necessary to perform amplification of template 
DNA. One example of an amplification solution, a PCR amplification solution, is 
provided in the Examples below - it will be appreciated that various modifications may 
be made to the PCR solution. 

In one embodiment, the bead / amplification solution mixture is added dropwise 

15 into a spinning mixture of biocompatible oil (e.g., light mineral oil, Sigma) and allowed 
to emulsify. The oil used may be supplemented with one or more biocompatible 
emulsion stabilizers. These emulsion stabilizers may include Atlox 4912, Span 80, and 
other recognized and commercially available suitable stabilizers. Preferably, the droplets 
formed range in size from 5 micron to 500 microns, more preferably, from between about 

20 50 to 300 microns, and most preferably, from 100 to 150 microns. 

There is no limitation in the size of the microreactors. The microreactors should 
be sufficiently large to encompass sufficient amplification reagents for the degree of 
amplification required. However, the microreactors should be sufficiently small so that a 
population of microreactors, each containing a member of a DNA library, can be 

25 amplified by conventional laboratory equipment (e.g., PCR thennocycling equipment, 
test tubes, incubators and the like). 

With the limitations described above, the optimal size of a microreactor may be 
between 100 to 200 microns in diameter. Microreactors of this size would allow 
amplification of a DNA library comprising about 600,000 members in a suspension of 

30 microreactors of less than 10 ml in volume. For example, if PCR was the chosen 

amplification method, 10 mis would fit in 96 tubes of a regular thermocycler with 96 rube 
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capacity. In a preferred embodiment, the suspension of 600,000 microreactors would 
have a volume of less than 1 ml. A suspension of less than 1 ml may be amplified in 
about 10 tubes of a conventional PCR thermocycler. In a most preferred embodiment, 
the suspension of 600,000 microreactors would have a volume of less than 0.5 ml. 

5 

Amplification 

After encapsulation, the template nucleic acid may be amplified by any suitable 
method of DNA amplification including transcription-based amplification systems 
(Kwoh D. et al., Proc. Natl. Acad Sci. (U.S.A.) 86: 1 1 73 (1 989); Gingeras T. R. el al., 

1 0 PCT appl. WO 88/1 03 15; Davey, C. et al., European Patent Application Publication No. 
329,822; Miller, H. I. et al., PCT appl. WO 89/06700, and "race" (Frohman, M. A, In: 
PCR Protocols: A Guide to Methods and Applications, Academic Press, NY (1990)) and 
"one-sided PCR" (Ohara, O. et al., Proc. Natl. Acad. Sci. (U.S.A.) 86.5673-5677 (1989)). 
Still other less common methods such as "di-oligonucleotide" amplification, isothermal 

15 amplification (Walker, G. T. et al., Proc. Natl. Acad. Sci. (U.S.A.) 89:392-396 (1992)), 
and rolling circle amplification (reviewed in 5,714320), may be used in the present 
invention. 

In a preferred embodiment, DNA amplification is performed by PCR. PCR 
according to the present invention may be performed by encapsulating the target nucleic 

20 acid, bound to a bead, with a PCR solution comprising all the necessary reagents for 
PCR. Then, PCR may be accomplished by exposing the emulsion to any suitable 
thermocycling regimen known in the art. In a preferred embodiment, between 30 and 50 
cycles, preferably about 40 cycles, of amplification are performed. It is desirable, but not 
necessary, that following the amplification procedure there be one or more hybridization 

25 and extension cycles following the cycles of amplification. In a preferred embodiment, 
between 10 and 30 cycles, preferably about 25 cycles, of hybridization and extension are 
performed (e.g., as described in the examples). Routinely, the template DNA is amplified 
until typically at least two million to fifty million copies, preferably about ten million to 
thirty million copies of the template DNA are immobilized per bead. 

30 
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Breaking the Emulsion and Bead Recovery 

Following amplification of the template, the emulsion is "broken" (also referred 
to as "demulsification" in the art). There are many methods of breaking an emulsion 
5 (see, e.g., U.S. Patent 5,989,892 and references cited therein) and one of skill in the art 
would be able to select the proper method. In the present invention, one preferred 
method of breaking the emulsion is to add additional oil to cause the emulsion to separate 
into two phases. The oil phase is then removed, and a suitable organic solvent (e.g., 
hexanes) is added. After mixing, the oil/organic solvent phase is removed. This step 
10 may be repeated several times. Finally, the aqueous layers above the beads are removed. 
The beads are then washed with an organic solvent / annealing buffer mixture (e.g., one 
suitable annealing buffer is described in the examples), and then washed again in 
annealing buffer. Suitable organic solvents include alcohols such as methanol, ethanol 
and the like. 

15 The amplified template-containing beads may then be resuspended in aqueous 

solution for use, for example, in a sequencing reaction according to known technologies. 
{See, Sanger, F. et al, Proc. Natl. Acad. Sci. U.S.A. 75, 5463-5467 (1977); Maxam, A. 
M. & Gilbert, W. Proc Natl Acad Sci USA 74, 560-564 (1977); Ronaghi, M. et ah, 
Science 281, 363, 365 (1998); Lysov, I. et al., Dokl Akad Nauk SSSR 303, 1508-151 1 

20 (1988); Bains W. & Smith G. C. J.TheorBiol 135, 303-307(1988); Drnanac, R. et al. 
Genomics 4, 1 14-128 (1989); Khrapko, K. R. et al., FEBS Lett 256. 1 18-122 (1989); 
Pevzner P. A. J Biomol Struct Dyn 7, 63-73 (1989); Southern, E. M. et al. Genomics 13, 
1008-1017 (1992).) If the beads are to be used in a pyrophosphate-based sequencing 
reaction (described, e.g, in US patent 6,274,320, 6258,568 and 6,210,891, and 

25 incorporated in toto herein by reference), then it is necessary to remove the second strand 
of the PCR product and anneal a sequencing primer to the single stranded template that is 
bound to the bead. 

Briefly, the second strand is melted away using any number of commonly known 
methods such as NaOH, low ionic {e.g., salt) strength, or heat processing. Following this 
30 melting step, the beads are pelleted and the supernatant is discarded. The beads are 
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resuspended in an annealing buffer, the sequencing primer added, and annealed to the 
bead-attached single stranded template using a standard annealing cycle. 



Purifying the beads 

5 At this point, the amplified DNA on the bead may be sequenced either directly on 

the bead or in a different reaction vessel. In an embodiment of the present invention, the 
DNA is sequenced directly on the bead by transferring the bead to a reaction vessel and 
subjecting the DNA to a sequencing reaction (e.g., pyrophosphate or Sanger sequencing). 
Alternatively, the beads may be isolated and the DNA may be removed from each bead 

10 and sequenced. In either case, the sequencing steps may be performed on each individual 
bead. However, this method, while commercially viable and technically feasible, may 
not be most effective because many of the beads will be negative beads (a bead that does 
not have amplified DNA attached). Accordingly, the following optional process may be 
used for removing beads that contain no nucleic acid template prior to distribution onto 

15 the picotiter plate. 

A high percentage of the beads may be "negative" (i.e. ? have no amplified nucleic 
acid template attached thereto) if the goal of the initial DNA attachment is to minimize 
beads with two different copies of DNA. For useful pyrophosphate sequencing, each 
bead should contain multiple copies of a single species of DNA. This requirement is 

20 most closely approached by maximizing the total number of beads with a single fragment 
of DNA bound (before amplification). This goal can be achieved by the observation of a 
mathematical model. 

For the general case of "N" number of DNAs randomly distributed among M 
number of beads, the relative bead population containing any number of DNAs depends 

25 on the ratio of N/M. The fraction of beads containing N DNAs R(N) may be calculated 
using the Poisson distribution: 

R(N) = exp-(N/M) X (N/M) N /N! (where X is the multiplication symbol) 
The table below shows some calculated values for various N/M (the average DNA 
fragment to bead ratio) and N (the number of fragments actually bound to a bead). 

30 
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N/M 


0.1 


0.5 


1 


2 


R(0) 


0.9 


0.61 


0.37 


0.13 


R(l) 


0.09 


0.3 


0.37 


0.27 


R(N>1) 


0.005 


0.09 


0.26 


0.59 



In the table the top row denotes the various ratios of N/M. R(0) denotes the 
fraction of beads with no DNA, R(l) denotes the fraction of beads with one DNA 

5 attached (before amplification) and R(N>1 ) denotes the fraction of DNA with more than 
one DNA attached (before amplification). 

The table indicates that the maximum fraction of beads containing a single DNA 
fragment is 0.37 (37%) and occurs at a fragment to bead ratio of one. In this mixture, 
about 63% of the beads is useless for sequencing because they have either no DNA or 

10 more than a single species of DNA. Additionally, controlling the fragment to bead ratio 
require complex calculations and variability could produce bead batches with a 
significantly smaller fraction of useable beads. 

This inefficiency could be significantly ameliorated if beads containing amplicon 
(originating from the binding of at least one fragment) could be separated from those 

15 without amplicon (originating from beads with no bound fragments). An amplicon is 
defined as any nucleic acid molecules produced by an in vitro nucleic amplification 
technique. Binding would be done at low average fragment-to-bead ratios (N/M <1), 
minimizing the ratio of beads with more than one DNA bound. A separation step would 
remove most or all of the beads with no DNA leaving an enriched population of beads 

20 with one species of amplified DNA. These beads may be applied to any method of 
sequencing such as, for example, pyrophosphate sequencing. Because the fraction of 
beads with one amplicon (N=l ) has been enriched, any method of sequencing would be 
more efficient. 

As an example, with an average fragment to bead ratio of 0.1, 90% of the beads 
25 will have no amplicon, 9% of the beads would be usefiil with one amplicon, and 0.5% of 
the beads will have more than one amplicon. An enrichment process of the invention will 
remove the 90% of the zero amplicon beads leaving a population of beads where the 
sequenceable fraction (N=l) is: 

39 
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1-(0.005/().09)=94%. 

Dilution of the fragment to bead mixture, along with separation of beads 
containing amplicon can yield an enrichment of 2.5 folds over the optimal unenriched 
method. 94%/37% (see table above N/M=l) = 2.5. An additional benefit of the 
5 enrichment procedure of the invention is that the ultimate fraction of sequenceable beads 
is relatively insensitive to variability in N/M. Thus, complex calculations to derive the 
optimal N/M ratio are either unnecessary or may be performed to a lower level of 
precision. This will ultimately make the procedure more suitable to performance by less 
trained personnel or automation. An additional benefit of the procedure is that the zero 

10 amplicon beads may be recycled and reused. While recycling is not necessary, it may 
reduce cost or the total bulk of reagents making the method of the invention more 
suitable for some purposes such as, for example, portable sampling, remote robotic 
sampling and the like. In addition, all the benefits of the procedure (i.e., less trained 
personnel, automation, recycling of reagents) will reduce the cost of the procedure. The 

15 procedure is described in more detail below. 

The enrichment procedure may be used to treat beads that have been amplified in 
the bead emulsion method above. The amplification is designed so that each amplified 
molecule contains the same DNA sequence at its 3' end. The nucleotide sequence may 
be a 20 mer but may be any sequence from 1 5 bases or more such as 25 bases, 30 bases, 

20 35 bases, or 40 bases or longer. Naturally, while longer oligonucleotide ends are 

functional, they are not necessary. This DNA sequence may be introduced at the end of 
an amplified DNA by one of skill in the art. For example, if PCR is used for 
amplification of the DNA, the sequence may be part of one member of the PCR primer 
pair. 

25 A schematic of the enrichment process is illustrated in Figure 7. Here, the 

amplicon-bound bead mixed with 4 empty beads represents the fragment-diluted 
amplification bead mixture. In step 1, a biotinylated primer complementary to the 3' end 
of the amplicon is annealed to the amplicon. In step 2, DNA polymerase and the four 
natural deoxynucleolidcs triphosphates (dNTPs) are added to the bead mix and the 

30 biotinylated primer is extended. This extension is to enhance the bonding between the 
biotinylated primer and the bead-bound DNA. This step may be omitted if the 
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biotinylatcd primer - DNA bond is strong (e.g., in a high ionic environment). In Slep 3, 
streptavidin coated beads susceptible to attraction by a magnetic field (referred to herein 
as "magnetic streptavidin beads") are introduced to the bead mixtures. Magnetic beads 
are commercially available, for example, from Dynal (M290). The streptavidin capture 

5 moieties binds biotins hybridized to the amplicons, which then specifically fix the 
ampl icon-bound beads to the magnetic streptavidin beads. 

In step 5, a magnetic field (represented by a magnet) is applied near the reaction 
mixture, which causes all the "magnetic streptavidin beads/amplicon bound bead 
complexes" to be positioned along one side of the tube most proximal to the magnetic 

10 field. Magnetic beads without amplicon bound beads attached are also expected to be 
positioned along the same side. Beads without amplicons remain in solution. The bead 
mixture is washed and the beads not immobilized by the magnet (i.e., the empty beads) 
are removed and discarded. In step 6, the extended biotinylatcd primer strand is 
separated from the amplicon strand by "melting" - a step that can be accomplished, for 

1 5 example, by heat or a change in pH. The heat may be 60°C in low salt conditions (e.g., in 
a low ionic environment such as 0.1X SSC). The change in pH may be accomplished by 
the addition of NaOH. The mixture is then washed and the supernatant, containing the 
amplicon bound beads, is recovered while the now unbound magnetic beads are retained 
by a magnetic field. The resultant enriched beads may be used for DNA sequencing. It 

20 is noted that the primer on the DNA capture bead may be the same as the primer of step 2 
above. In this case, annealing of the amplicon-primer complementary strands (with or 
without extension) is the source of target-capture affinity. 

The biotin streptavidin pair could be replaced by a variety of capture-target pairs. 
Two categories are pairs whose binding can be subsequently cleaved and those which 

25 bind irreversibly, under conditions that are practically achievable. Cleavable pairs 
include thiol-thiol, Digoxigenin/ anti- Digoxigenin, -Captavidin™ if cleavage of the 
target-capture complex is desired. 

As described above, step 2 is optional. If step 2 is omitted, it may not be 
necessary to separate the magnetic beads from the amplicon bound beads. The amplicon 

30 bound beads, with the magnetic beads attached, may be used directly for sequencing. If 
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the sequencing were to be performed in a microwell, separation would not be necessary if 
the amplicon bound bead-magnetic bead complex can fit inside the microwell. 

While the use of magnetic capture beads is convenient, capture moieties can be 
bound to other surfaces. For example, streptavidin could be chemically bound to a 
5 surface, such as, the inner surface of a tube. In this case, the amplified bead mixture may 
be flowed through. The amplicon bound beads will tend to be retained until "melting" 
while the empty beads will flow through. This arrangement may be particularly 
advantageous for automating the bead preparation process. 

While the embodiments described above is particularly useful, other methods can 

10 be envisioned to separate beads. For example, the capture beads may be labeled with a 
fluorescent moiety which would make the target-capture bead complex fluorescent. The 
target capture bead complex may be separated by flow cytometry or fluorescence cell 
sorter. Using large capture beads would allow separation by filtering or other particle 
size separation techniques. Since both capture and target beads are capable of forming 

15 complexes with a number of other beads, it is possible to agglutinate a mass of cross- 
linked capture-target beads. The large size of the agglutinated mass would make 
separation possible by simply washing away the unagglutinated empty beads. The 
methods described are described in more detail, for example, in Bauer, J.; J. 
Chromatography B, 722 (1999) 55-69 and in Brody et al., Applied Physics Lett. 74 

20 (1999) 144-146. 

The DNA capture beads each containing multiple copies of a single species of 
nucleic acid template prepared according to the above method are then suitable for 
distribution onto the picotiter plate. 

25 Nucleic Acid Amplification On The Picotiter Plate 

In an alternative embodiment, the nucleic acid template is distributed onto the 
picotiter plate prior to amplification and then amplified in situ on the picotiter plate. This 
method is described in detail in the Examples. 
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3. SEQUENCING THE NUCLEIC ACID TEMPLATE 

Pyrophosphate sequencing is used according to the methods of this invention to 
sequence the nucleic acid template. This technique is based on the detection of released 
5 pyrophosphate (Ppi) during DNA synthesis. See, e.g., Hyman, 1988. A new method of 
sequencing DNA. Anal Biochem. 174:423-36; Ronaghi, 2001. Pyroscquencing sheds 
light on DNA sequencing. Genome Res. 1 1:3-1 1. 

In a cascade of enzymatic reactions, visible light is generated proportional to the 
number of incoiporated nucleotides. The cascade starts with a nucleic acid 

10 polymerization reaction in which inorganic Ppi is released with nucleotide incorporation 
by polymerase. The released Ppi is converted to ATP by ATP sulfurylase, which 
provides the energy to luciferase to oxidize luciferin and generates light. Because the 
added nucleotide is known, the sequence of the template can be determined. Solid-phase 
pyrophosphate sequencing utilizes immobilized DNA in a three-enzyme system (see 

15 Figures). To increase the signal-to-noise ratio, the natural dATP has been replaced by 
dATPaS. Typically dATPctS is a mixture of two isomers (Sp and Rp): the use of pure 2'- 
deoxyadcnosinc-5 -O - ( 1 -thiotriphosphate) Sp-isomer in pyrophosphate sequencing 
allows substantially longer reads, up to doubling of the read length. 

20 4. APPARATUS FOR SEQUENCING NUCLEIC ACIDS 

This invention provides an apparatus for sequencing nucleic acids, which 
generally comprises one or more reaction chambers for conducting a sequencing reaction, 
means for delivering reactants to and from the reaction chamber(s), and means for 
detecting a sequencing reaction event. In another embodiment, the apparatus includes a 

25 reagent delivery cuvette containing a plurality of cavities on a planar surface. In a 

preferred embodiment, the apparatus is connected to at least one computer for controlling 
the individual components of the apparatus and for storing and/or analyzing the 
information obtained from detection of the sequence reaction event. 

The invention also provides one or more reaction chambers arc arranged on an 

30 inert substrate material, also referred to herein as a "solid support", that allows for 
discrete localization of the nucleic acid template and of the reactants in a sequencing 
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reaction in a defined space, as well as for detection of the sequencing reaction event. 

Thus, as used herein, the terms ''reaction chamber" or "analyte reaction chamber" refer to 

a localized area on the substrate material that facilitates interaction of reactants, e.g., in a 

nucleic acid sequencing reaction. As discussed more fully below, the sequencing 
5 reactions contemplated by the invention preferably occur on numerous individual nucleic 

acid samples in tandem, in particular simultaneously sequencing numerous nucleic acid 

samples derived from genomic and chromosomal nucleic acid templates (e.g., DNA). 

The apparatus of the invention therefore preferably comprises a sufficient number 

of reaction chambers to carry out such numerous individual sequencing reactions. In one 
10 embodiment, there are at least 10,000 reaction chambers, preferably at least 50,000 

reaction chambers, more preferably greater than 100,000 reaction chambers, even more 

preferably greater than 200,000 reaction chambers. 

Since the number of simultaneous sequencing reactions is limited by the number 

of reaction chambers, the throughput may be increased by fabricating plates containing 
15 increasing densities of wells. The table below shows this progression for a 14 x 43 mm 

and 30 x 60 mm active areas, derived from 25 x 75 mm and 40 x 75 mm arrays, 

respectively. 

Table: Development of higher well count arrays* 



Pitch 


Well 


# of Wells 


# of Wells 


(urn) 


Diameter (urn) 


(14 x 43 mm) 


(30 x 60 mm) 


50 


44 


275K 


800K 


43 


38 


375K 


1.2M 


35 


31 


575K 


1.6M 


25 


22 


1.1M 


3.2M 



The reaction chambers on the array typically take the form of a cavity or well in 
the substrate material, having a width and depth, into which reactants can be deposited. 
Typically the nucleic acid template is distributed into the reaction chamber on one or 
30 more solid supports or beads; the reactants are in a medium which facilitates the reaction 
and which flows through the reaction chamber. When formed as cavities or wells, the 
chambers are preferably of sufficient dimension and order to allow for (i) the introduction 
of the necessary reactants into the chambers, (ii) reactions to take place within the 
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chamber and (iii) inhibition of mixing of reactants between chambers. The shape of the 
well or cavity is preferably circular or cylindrical, but can be multisided so as to 
approximate a circular or cylindrical shape. In one preferred embodiment, the shape of 
the well or cavity is substantially hexagonal. The cavity can have a smooth wall surface. 

5 In an additional embodiment, the cavity can have at least one irregular wall surface. The 
cavities can have a planar bottom or a concave bottom. 

The reaction chambers can be spaced between 5|am and 200^im apart. Spacing is 
determined by measuring the center-to-center distance between two adjacent reaction 
chambers. Typically, the reaction chambers can be spaced between lO^tm and 1 50jim 

10 apart, preferably between 20 jxm and 100 jim apart, most preferably between 40 and 60 
tun apart. In one embodiment, the reaction chambers have a width (diameter) in one 
dimension of between 0.3 ^m and 1 00 jam, more preferably between 20 urn and 70 jam 
and most preferably about 30 and 50 \xm. The depth of the reaction chambers are 
preferably between 10 jam and 100 jim, preferably between 20 \xm and 60 (am. 

15 Alternatively, the reaction chambers may have a depth that is between 0.25 and 5 times 
the width in one dimension of the reaction chamber or, in another embodiment, between 
0.3 and I times the width in one dimension of the reaction chamber. 

In a preferred embodiment, the array is fashioned from a sliced fiber optic bundle 
(i.e., a bundle of fused fiber optic cables) and the reaction chambers are formed by 

20 etching one surface of the fiber optic reactor array. The cavities can also be formed in the 
substrate via etching, molding or micromachining. 

Each cavity or reaction chamber typically has a depth of between 10 ^ra and 100 
jim; alternatively, the depth is between 0.25 and 5 times the size of the width of the 
cavity, preferably between 0.3 and 1 times the size of the width of the cavity. 

25 In one embodiment, the arrays described herein typically include a planar top 

surface and a planar bottom surface, which is optically conductive such that optical 
signals from the reaction chambers can be detected through the bottom planar surface. In 
these arrays, typically the distance between the top surface and the bottom surface is no 
greater than 10 cm, preferably no greater than 2 cm, and usually between 0.5 mm to 

30 5 mm, most preferably about 2 mm. 
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In a particularly preferred embodiment, the solid support is termed a 
picotiterplatc, with reaction chambers having a center to center spacing of about 43 |im 
to 50 |am, a well diameter of between 38 \xm to 44 jam, and a well volume of between 
10 to 1 50 pL, preferably between 20 to 90 pL, more preferably between 40 to 85 pL, and 
5 most preferably about 75 pL. 

In one embodiment, each cavity or reaction chamber of the array contains 
reagents for analyzing a nucleic acid or protein. Typically those reaction chambers that 
contain a nucleic acid (not all reaction chambers in the array are required to) contain only 
a single species of nucleic acid (i.e., a single sequence that is of interest). There may be a 

10 single copy of this species of nucleic acid in any particular reaction chamber, or there 
may be multiple copies. It is generally preferred that a reaction chamber contain at least 
100,000 copies of the nucleic acid template sequence, preferably at least 1,000,000 
copies, and more preferably between 2,000,000 to 20,000,000 copies, and most 
preferably between 5,000,000 to 1 5,000,000 copies of the nucleic acid. The ordinarily 

15 skilled artisan will appreciate that changes in the number of copies of a nucleic acid 
species in any one reaction chamber will affect the number of photons generated in a 
pyrosequencing reaction, and can be routinely adjusted to provide more or less photon 
signal as is required. In one embodiment the nucleic acid species is amplified to provide 
the desired number of copies using PCR, RCA, ligase chain reaction, other isothermal 

20 amplification, or other conventional means of nucleic acid amplification. In one 
embodiment, the nucleic acid is single stranded. 

Solid Support Material 

Any material can be used as the solid support material, as long as the surface 
25 allows for stable attachment of the primers and detection of nucleic acid sequences. The 
solid support material can be planar or can be cavitated, e.g., in a cavitated terminus of a 
fiber optic or in a microwell etched, molded, or otherwise micromachined into the planar 
surface, e.g. using techniques commonly used in the construction of 
microelectromechanical systems. See e.g., Rai-Choudhury, Handbook of 

30 MlCROUTHOGRAPHY, M ICROM ACH1N1NG, AND MlCROFABRICATION, VOLUME 1: 

Microlithograpi iy, Volume PM39, SPIE Press (1997); Madou, CRC Press (1997), 
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Aoki, Biotech. Histochem. 67: 98-9 (1 992); Kane et aL, Biomaterials. 20: 2363-76 

(1999) ; Deng et aL, Anal. Chem. 72:3176-80 (2000); Zhu et aL, Nat. Genet. 26:283-9 

(2000) . In some embodiments, the solid support is optically transparent, e.g., glass. 
An array of attachment sites on an optically transparent solid support can be 

5 constructed using lithographic techniques commonly used in the construction of 

electronic integrated circuits as described in, e.g., techniques for attachment described in 
U.S. Patent Nos. 5,143,854, 5,445,934, 5,744,305, and 5,800,992; Chee et aL, Science 
274: 610-614 (1996); Fodor et aL, Nature 364: 555-556 (1993); Fodor et aL, Science 
251: 767-773 (1991); Gushin, et aL, Anal. Biochem. 250: 203-21 1 (1997); Kinosita et 

10 aL, Cell 93: 21-24 (1998); Kato-Yamada et aL, J. Biol. Chem. 273: 19375-19377 
(1998); and Yasuda et aL 9 Cell 93: 1117-1 124 (1998). Photolithography and electron 
beam lithography sensitize the solid support or substrate with a linking group that allows 
attachment of a modified biomolecule (e.g., proteins or nucleic acids). See e.g., Service, 
Science 283: 27-28 (1999); Rai-Choudhury, Handbook of Microlithography, 

15 MICROMACHINING, AND MlCROFABRICATlON, VOLUME I: MlCROLITHOGRAPHY, Volume 
PM39, SPIE Press (1997). Alternatively, an array of sensitized sites can be generated 
using thin-film technology as described in Zasadzinski et aL, Science 263: 1726-1733 
(1994). 

The substrate material is preferably made of a material that facilitates detection of 
20 the reaction event. For example, in a typical sequencing reaction, binding of a dNTP to a 
sample nucleic acid to be sequenced can be monitored by detection of photons generated 
by enzyme action on phosphate liberated in the sequencing reaction. Thus, having the 
substrate material made of a transparent or light conductive material facilitates detection 
of the photons. 

25 In some embodiments, the solid support can be coupled to a bundle of optical 

fibers that are used to detect and transmit the light product. The total number of optical 
fibers within the bundle may be varied so as to match the number of individual reaction 
chambers in the array utilized in the sequencing reaction. The number of optical fibers 
incorporated into the bundle is designed to match the resolution of a detection device so 

30 as to allow 1 : 1 imaging. The overall sizes of the bundles are chosen so as to optimize the 
usable area of the detection device while maintaining desirable reagent (flow) 
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characteristics in the reaction chamber. Thus, for a 4096 x 4096 pixel CCD (charge- 
coupled device) array with 1 5 \im pixels, the fiber bundle is chosen to be approximately 
60 mm x 60 mm or to have a diameter of approximately 90 ram. The desired number of 
optical fibers are initially fused into a bundle or optical fiber array, the terminus of which 
5 can then be cut and polished so as to form a "wafer'' of the required thickness (e.g.. 

1.5 mm). The resulting optical fiber wafers possess similar handling properties to that of 
a plane of glass. The individual fibers can be any size diameter (e.g., 3|im to 100 jim). 

In some embodiments two fiber optic bundles are used: a first bundle is attached 
directly to the detection device (also referred to herein as the fiber bundle or connector) 

10 and a second bundle is used as the reaction chamber substrate (the wafer or substrate). In 
this case the two are placed in direct contact, optionally with the use of optical coupling 
fluid, in order to image the reaction centers onto the detection device. If a CCD is used 
as the detection device, the wafer could be slightly larger in order to maximize the use of 
the CCD area, or slightly smaller in order to match the format of a typical microscope 

15 s iide — 25 mm x 75 mm. The diameters of the individual fibers within the bundles are 
chosen so as to maximize the probability that a single reaction will be imaged onto a 
single pixel in the detection device, within the constraints of the state of the art. 
Exemplary diameters are 6-8 \im for the fiber bundle and 6-50 |im for the wafer, though 
any diameter in the range 3-100 jam can be used. Fiber bundles can be obtained 

20 commercially from CCD camera manufacturers. For example, the wafer can be obtained 
from Incom, Inc. (Charlton, MA) and cut and polished from a large fusion of fiber optics, 
typically being 2 mm thick, though possibly being 0.5 to 5 mm thick. The wafer has 
handling properties similar to a pane of glass or a glass microscope slide. 

Reaction chambers can be formed in the substrate made from fiber optic material. 

25 The surface of the optical fiber is cavitated by treating the termini of a bundle of fibers. 
e.g., with acid, to form an indentation in the fiber optic material. Thus, in one 
embodiment cavities are formed from a fiber optic bundle, preferably cavities can be 
formed by etching one end of the fiber optic bundle. Each cavitated surface can form a 
reaction chamber. Such arrays are referred to herein as fiber optic reactor arrays or 

30 FORA. The indentation ranges in depth from approximately one-half the diameter of an 
individual optical fiber up to two to three times the diameter of the fiber. Cavities can be 
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introduced into the termini of the fibers by placing one side of the optical fiber wafer into 
an acid bath for a variable amount of time. The amount of time can vary depending upon 
the overall depth of the reaction cavity desired (see e.g., Wall, et al., 1996. Anal. Chan. 
70: 1888). A wide channel cavity can have uniform flow velocity dimensions of 
5 approximately 14mm x 43mm. Thus, with this approximate dimension and at 

approximately 4.82 x 10" 4 cavities/um 2 density, the apparatus can have approximately 
290,000 fluidically accessible cavities. Several methods are known in the art for 
attaching molecules (and detecting the attached molecules) in the cavities etched in the 
ends of fiber optic bundles. See, e.g., Michael, et al. y Anal. Chem. 70: 1242-1248 

10 (1998); Ferguson, et ai, Nature Biotechnology 14: 1681-1684 (1996); Healey and Walt, 
Anal. Chem. 69: 2213-2216 (1997). A pattern of reactive sites can also be created in the 
microwell, using photolithographic techniques similar to those used in the generation of a 
pattern of reaction pads on a planar support. See, Healey, et al. Science 269: 1078-1080 
(1995); Munkholm and Walt, Anal. Chem. 58: 1427-1430 (1986), and Bronk, et al, 

1 5 Anal Chem. 67: 21 '50-27 '57 ( 1 995). 

The opposing side of the optical fiber wafer (i.e., the non-etched side) is typically 
highly polished so as to allow optical-coupling (e.g., by immersion oil or other optical 
coupling fluids) to a second, optical fiber bundle. This second optical fiber bundle 
exactly matches the diameter of the optical wafer containing the reaction chambers, and 

20 serve to act as a conduit for the transmission of light product to the attached detection 
device, such as a CCD imaging system or camera. 

In one preferred embodiment, the fiber optic wafer is thoroughly cleaned, e.g. by 
serial washes in 15% H 2 02/15%NH40H volumervolume in aqueous solution, then six 
deionized water rinses, then 0.5M EDTA, then six deionized water, then 15% 

25 H202/15%NH40H, then six deionized water (one-half hour incubations in each wash). 

The surface of the fiber optic wafer is preferably coated to facilitate its use in the 
sequencing reactions. A coated surface is preferably optically transparent, allows for 
easy attachment of proteins and nucleic acids, and does not negatively affect the activity 
of immobilized proteins. In addition, the surface preferably minimizes non-specific 

30 absorption of macromolecules and increases the stability of linked macromolecules (e.g., 
attached nucleic acids and proteins). 
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Suitable materials for coating the array include, e.g.* plastic (e.g. polystyrene). 
The plastic can be preferably spin-coated or sputtered (0.1 |im thickness). Other 
materials for coating the array include gold layers, e.g. 24 karat gold, 0.1 jam thickness, 
with adsorbed self-assembling monolayers of long chain thiol alkanes. Biotin is then 
5 coupled covalently to the surface and saturated with a biotin-binding protein (e.g. 
streptavidin or avidin). 

Coating materials can additionally include those systems used to attach an anchor 
primer to a substrate. Organosilane reagents, which allow for direct covalent coupling of 
proteins via amino, sulfhydryl or carboxyl groups, can also be used to coat the array. 
10 Additional coating substances include photoreactive linkers, e.g. photobiotin, (Amos et 
ah, "Biomaterial Surface Modification Using Photochemical Coupling Technology," in 
Encyclopedic Handbook of Biomaterials and Bioengineering, Part A: Materials* Wise et 
al. (eds.). New York, Marcel Dekkcr, pp. 895926, 1995). 

Additional coating materials include hydrophilic polymer gels (polyacrylamide, 
15 polysaccharides), which preferably polymerize directly on the surface or polymer chains 
covalently attached post polymerization (Hjerten,/. Chroma togr. 347,191 (1985); 
Novotny, Anal. Chem. 62,2478 (1990), as well as pluronic polymers (triblock 
copolymers, e.g. PPO-PEO-PPO, also known as F-108), specifically adsorbed to either 
polystyrene or silanized glass surfaces (Ho et al., Langmuir 14:3889-94, 1998). as well 
20 as passively adsorbed layers of biotin-binding proteins. The surface can also be coated 
with an epoxide which allows the coupling of reagents via an amine linkage. 

In addition, any of the above materials can be derivatized with one or more 
functional groups, commonly known in the art for the immobilization of enzymes and 
nucleotides, e.g. metal chelating groups (e.g. nitrilo triacetic acid, iminodiacetic acid r 
25 pentadenlate chelator), which will bind 6xHis-tagged proteins and nucleic acids. 

Surface coatings can be used that increase the number of available binding sites 
for subsequent treatments, e.g. attachment of enzymes (discussed later), beyond the 
theoretical binding capacity of a 2D surface. 

In a preferred embodiment, the individual optical fibers utilized to generate the 
30 fused optical fiber bundle/wafer are larger in diameter (i.e.. 6 |.im to 1 2 pm) than those 
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utilized in the optical imaging system (i.e., 3 ^m). Thus, several of the optical imaging 
fibers can be utilized to image a single reaction site. 

In a particularly preferred embodiment, the sample cartridge for nucleic acid 
templete sequencing, termed the PicoTiter plate* is formed from a commercial fiber 
5 optics faceplate, acid-etched to yield well structures. Each optic fiber core is about 44 
microns in diameter, with a 2-3 micron cladding, each well formed by acid etching to 
form a reaction well volume of about 65 pL to 85 pL, most preferably about 75 pL. The 
use of etched wells on a fiber optics faceplate surface serves a threefold purpose; i) 
delayed diffusion of the luminescence from emitting light in a different region of the 
1 0 array, ii) isolation of reaction chambers ("test-tubes' ) that contain the amplified template 
molecules, and iii) very efficient, high numerical aperture optical coupling to the CCD. 
Finally, the larger the amount of sequencing template immobilized within a well, the 
more optical signal one is able to achieve. 

15 Delivery Means 

An example of the means for delivering reactants to the reaction chamber is the 
perfusion chamber of the present invention is illustrated in Figure 13. The perfusion 
chamber includes a sealed compartment with transparent upper and lower side. It is 
designed to allow flow of solution over the surface of the substrate surface and to allow 

20 for fast exchange of reagents. Thus, it is suitable for carrying out, for example, the 

pyrophosphate sequencing reactions. The shape and dimensions of the chamber can be 
adjusted to optimize reagent exchange to include bulk flow exchange, diffusive 
exchange, or both in either a laminar flow or a turbulent flow regime. 

The perfusion chamber is preferably detached from the imaging system while it is 

25 being prepared and only placed on the imaging system when sequencing analysis is 

perfonned. In one embodiment, the solid support (/.<?., a DNA chip or glass slide) is held 
in place by a metal or plastic housing, which may be assembled and disassembled to 
allow replacement of said solid support. The lower side of the solid support of the 
perfusion chamber carries the reaction chamber array and, with a traditional optical-based 

30 focal system, a high numerical aperture objective lens is used to focus the image of the 
reaction center array onto the CCD imaging system. 
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Many samples can thereby be analyzed in parallel. Using the method of the 
invention, many nucleic acid templates maybe analyzed in this was by allowing the 
solution containing the enzymes and one nucleotide to flow over the surface and then 
detecting the signal produced for each sample. This procedure can then be repeated. 
5 Alternatively, several different oligonucleotides complementary to the template may be 
distributed over the surface followed by hybridization of the template. Incorporation of 
deoxynucleotides or dideoxynucleotides may be monitored for each oligonucleotide by 
the signal produced using the various oligonucleotides as primer. By combining the 
signals from different areas of the surface, sequence-based analyses may be performed by 

1 0 four cycles of polymerase reactions using the various dideoxynucleotides. 

When the support is in the form of a cavitated array, e.g., in the termini of a 
picotiter plate or other array of microwells, suitable delivery means for reagents include 
flowing and washing and also, e.g., flowing, spraying, electrospraying, ink jet delivery, 
stamping, ultrasonic atomization (Sonotek Corp., Milton, NY) and rolling. When 

15 spraying is used, reagents may be delivered to the picotiter plate in a homogeneous thin 
layer produced by industrial type spraying nozzles (Spraying Systems, Co., Whcaton, 1L) 
or atomizers used in thin layer chromatography (TLC), such as CAMAG TLC Sprayer 
(Camag Scientific Inc., Wilmington, NC). These sprayers atomize reagents into aerosol 
spray particles in the size range of 0.3 to 10 |im. 

20 Successive reagent delivery steps are preferably separated by wash steps using 

techniques commonly known in the art. These washes can be performed, e.g., using the 
above described methods, including high-flow sprayers or by a liquid flow over the 
picotiter plate or microwell array surface. The washes can occur in any time period after 
the starting material has reacted with the reagent to form a product in each reaction 

25 chamber but before the reagent delivered to any one reaction chamber has diffused out of 
that reaction chamber into any other reaction chamber. In one embodiment, any one 
reaction chamber is independent of the product formed in any other reaction chamber, but 
is generated using one or more common reagents. 

An embodiment of a complete apparatus is illustrated in Figure 12. The apparatus 

30 includes an inlet conduit 200 in communication with a detachable perfusion chamber 
226. The inlet conduit 200 allows for entry of sequencing reagents via a plurality of 
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tubes 202-212, which are each in communication with a plurality of sequencing 
dispensing reagent vessels 2 14-224. 

Reagents are introduced through the conduit 200 into the perfusion chamber 226 
using either a pressurized system or pumps to drive positive flow. Typically, the reagent 
5 flow rates are from 0.05 to 50 ml/minute (e.g., 1 to 50 ml/minute) with volumes from 
0.100 ml to continuous flow (for washing). Valves are under computer control to allow 
cycling of nucleotides and wash reagents. Sequencing reagents, e.g., polymerase can be 
either pre-mixed with nucleotides or added in stream. A manifold brings all six tubes 
202-212 together into one for feeding the perfusion chamber. Thus several reagent 
10 delivery ports allow access to the perfusion chamber. For example, one of the ports may 
be utilized to allow the input of the aqueous sequencing reagents, while another port 
allows these reagents (and any reaction products) to be withdrawn from the perfusion 
chamber. 

In a preferred embodiment, one or more reagents are delivered to an array 

15 immobilized or attached to a population of mobile solid supports, e.g., a bead or 

microsphere. The bead or microsphere need not be spherical, irregular shaped beads may 
be used. They are typically constructed from numerous substances, e.g., plastic, glass or 
ceramic and bead sizes ranging from nanometers to millimeters depending on the width 
of the reaction chamber. Various bead chemistries can be used e.g., methylstyrene, 

20 polystyrene, acrylic polymer, latex, paramagnetic, thoria sol, carbon graphite and 

titanium dioxide. The construction or chemistry of the bead can be chosen to facilitate 
the attachment of the desired reagent. 

In another embodiment, the bioactive agents are synthesized first, and then 
covalently attached to the beads. As is appreciated by someone skilled in the art, this will 

25 be done depending on the composition of the bioactive agents and the beads. The 
funclionalization of solid support surfaces such as certain polymers with chemically 
reactive groups such as thiols, amines, carboxyls, etc. is generally known in the art. 

In a preferred embodiment, the nucleic acid template is delivered to the picotiter 
plate on beads. The luciferase and sulfurylase enzymes are likewise delivered to each 

30 well on beads (see Figure), as is the DNA polymerase. It is noted that the one or more of 
the nucleic acid template, luciferase and sulfurylase may be delivered each on separate 
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beads, or together on the same bead. To allow sequencing DNA at raised temperatures, 
we have cloned and modified the thermostable sulfurylase from Bacillus 
steareothermophilus. We have also cloned and modified several luciferase enzymes for 
solid-phase enzyme activity, including P. pennsylvanica and P. pyralis. The P. pyralis 
5 luciferase is used in a preferred embodiment. 

"Blank" beads may be used that have surface chemistries that facilitate the 
attachment of the desired functionality by the user. Additional examples of these surface 
chemistries for blank beads include, but are not limited to, amino groups including 
aliphatic and aromatic amines, carboxylic acids, aldehydes, amides, chloromethyl groups, 

10 hydrazide, hydroxyl groups, sulfonates and sulfates. 

These functional groups can be used to add any number of different candidate 
agents to the beads, generally using known chemistries. For example, candidate agents 
containing carbohydrates may be attached to an amino-functionalized support; the 
aldehyde of the carbohydrate is made using standard techniques, and then the aldehyde is 

15 reacted with an amino group on the surface. In an alternative embodiment, a sulfhydryl 
linker may be used. There are a number of sulfhydryl reactive linkers known in the art 
such as SPDP, maleimides, ct-haloacetyls, and pyridyl disulfides (see for example the 
1994 Pierce Chemical Company catalog, technical section on cross-linkers, pages 155- 
200, incorporated here by reference) which can be used to attach cysteine containing 

20 proteinaceous agents to the support. Alternatively, an amino group on the candidate 

agent may be used for attachment to an amino group on the surface. For example, a large 
number of stable Afunctional groups are well known in the art, including 
homobifunctional and heterobifunctional linkers (see Pierce Catalog and Handbook, 
pages 155-200). In an additional embodiment, carboxyl groups (either from the surface 

25 or from the candidate agent) may be derivatized using well known linkers (see Pierce 
catalog). For example, carbodiimides activate carboxyl groups for attack by good 
nucleophiles such as amines (see Torchilin et aL, Critical Rev. Thereapeutic Drug 
Carrier Systems, 7(4):275-308 (1991)). Proteinaceous candidate agents may also be 
attached using other techniques known in the art, for example for the attachment of 

30 antibodies to polymers; see Slinkin et ah, Bioconj. Chem. 2:342-348 (1 991 ); Torchilin et 
al., supra; Trubetskoy et al., Bioconj. Chem. 3:323-327 (1992); King et al., Cancer Res. 
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54:6176-6185 (1994); and Wilbur et al., Bioconjugate Client. 5:220-235 (1994). It should 
be understood that the candidate agents may be attached in a variety of ways, including 
those listed above. Preferably, the mariner of attachment does not significantly alter the 
functionality of the candidate agent; that is, the candidate agent should be attached in 

5 such a flexible manner as to allow its interaction with a target. 

Specific techniques for immobilizing enzymes on beads arc known in the prior 
art. In one case, NH 2 surface chemistry beads are used. Surface activation is achieved 
with a 2.5% glutaraldehyde in phosphate buffered saline (10 mM) providing a pH of 6.9 
(138 rnM NaCl, 2.7 mM KC1). This mixture is stirred on a stir bed for approximately 2 

10 hours at room temperature. The beads are then rinsed with ultrapure water plus 0.01% 
Tween 20 (surfactant) -0.02%, and rinsed again with a pH 7.7 PBS plus 0.01% tween 20. 
Finally, the enzyme is added to the solution, preferably after being prefiltered using a 
0.45 nm amicon micropure filter. 

The population of mobile solid supports are disposed in the reaction chambers. In 

15 some embodiments, 5% to 20% of the reaction chambers can have a mobile solid support 
with at least one reagent immobilized thereon, 20% to 60% of the reaction chambers can 
have a mobile solid support with at least one reagent immobilized thereon or 50% to 
100% of the reaction chambers can have a mobile solid support with at least one reagent 
immobilized thereon. Preferably, at least one reaction chamber has a mobile solid 

20 support having at least one reagent immobilized thereon and the reagent is suitable for 
use in a nucleic acid sequencing reaction. 

In some embodiments, the reagent immobilized to the mobile solid support can be 
a polypeptide with sulfurylase activity, a polypeptide with luciferase activity or a 
chimeric polypeptide having both sulfurylase and luciferase activity. In one embodiment, 

25 it can be a ATP sulfurylase and luciferase fusion protein. Since the product of the 
sulfurylase reaction is consumed by luciferase, proximity between these two enzymes 
maybe achieved by covalently linking the two enzymes in the form of a fusion protein. 
This invention would be useful not only in substrate channeling but also in reducing 
production costs and potentially doubling the number of binding sites on streptavidin- 

30 coated beads. 



55 



Express Mail: EV 399526138 US 21465-510 UTIL 

MaUed January 28, 2004 

In another embodiment, the sulfurylase is a thermostable ATP sulfiirylase. In a 
preferred embodiment, the thermostable sulfurylase is active at temperatures above 
ambient (to at least 50°C). In one embodiment, the ATP sulfurylase is from a 
thermophile. In an additional embodiment, the mobile solid support can have a first 
5 reagent and a second reagent immobilized thereon, the first reagent is a polypeptide with 
sulfurylase activity and the second reagent is a polypeptide with luciferase activity. 

In another embodiment, the reagent immobilized to the mobile solid support can 
be a nucleic acid; preferably the nucleic acid is a single stranded concatamer. In a 
preferred embodiment, the nucleic acid can be used for sequencing a nucleic acid, e.g.. a 
1 0 pyrosequencing reaction. 

The invention also provides a method for detecting or quantifying ATP activity 
using a mobile solid support; preferably the ATP can be detected or quantified as part of 
a nucleic acid sequencing reaction. 

A picotiter plate that has been "carpeted" with mobile solid supports with either 
1 5 nucleic acid or reagent enzymes attached thereto is shown as Figure 1 5 . 

5. METHODS OF SEQUENCING NUCLEIC ACIDS 

Pyrophosphate-based sequencing is then performed. The sample DNA sequence 
and the extension primer are then subjected to a polymerase reaction in the presence of a 

20 nucleotide triphosphate whereby the nucleotide triphosphate will only become 

incorporated and release pyrophosphate (PPi) if it is complementary to the base in the 
target position, the nucleotide triphosphate being added either to separate aliquots of 
sample-primer mixture or successively to the same sample-primer mixture. The release 
of PPi is then detected to indicate which nucleotide is incorporated. 

25 In one embodiment, a region of the sequence product is determined by annealing 

a sequencing primer to a region of the template nucleic acid, and then contacting the 
sequencing primer with a DNA polymerase and a known nucleotide triphosphate, i.e., 
dATP, dCTP, dGTP, dTTP, or an analog of one of these nucleotides. The sequence can 
be determined by detecting a sequence reaction byproduct, as is described below. 

30 The sequence primer can be any length or base composition, as long as it is 

capable of specifically annealing to a region of the amplified nucleic acid template. No 
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particular structure for the sequencing primer is required so long as it is able to 
specifically prime a region on the amplified template nucleic acid. Preferably, the 
sequencing primer is complementary to a region of the template that is between the 
sequence to be characterized and the sequence hybridizable to the anchor primer. The 
5 sequencing primer is extended with the DNA polymerase to form a sequence product. 
The extension is performed in the presence of one or more types of nucleotide 
triphosphates, and if desired, auxiliary binding proteins. 

Incorporation of the dNTP is preferably determined by assaying for the presence 
of a sequencing byproduct. In a preferred embodiment, the nucleotide sequence of the 

1 0 sequencing product is determined by measuring inorganic pyrophosphate (PPi) liberated 
from a nucleotide triphosphate (dNTP) as the dNMP is incorporated into an extended 
sequence primer. This method of sequencing, termed Pyrosequencing™ technology 
(PyroSequencing AB, Stockholm, Sweden) can be performed in solution (liquid phase) or 
as a solid phase technique. PPi-based sequencing methods are described generally in, 

15 e.g., W09813523A1, Ronaghi, et aL, 1996. Anal. Biochem. 242: 84-89, Ronaghi, et al. y 
1998. Science 281 : 363-365 (1998) and USSN 2001/0024790. These disclosures of PPi 
sequencing are incorporated herein in their entirety, by reference. See also , e.g., US 
patents 6,210,891 and 6,258,568, each fully incorporated herein by reference. 

Pyrophosphate released under these conditions can be detected enzymatically 

20 {e.g., by the generation of light in the luciferase-luciferin reaction). Such methods enable 
a nucleotide to be identified in a given target position, and the DNA to be sequenced 
simply and rapidly while avoiding the need for electrophoresis and the use of potentially 
dangerous radiolabels. 

PPi can be detected by a number of different methodologies, and various 

25 enzymatic methods have been previously described (see e.g., Reeves, et aL, 1969. Anal. 
Biochem. 28: 282-287; Guillory, et aL, 1971. Anal. Biochem. 39: 170-180; Johnson, et 
a/., 1968. Anal. Biochem. 15: 273; Cook ; et aL. 1978. Anal Biochenu 91: 557-565; and 
Drake, etal., 1979. Anal. Biochem. 94: 117-120). 

PPi liberated as a result of incorporation of a dNTP by a polymerase can be 

30 converted to ATP using, e.g.. an ATP sulfurylase. This enzyme has been identified as 
being involved in sulfur metabolism. Sulfur, in both reduced and oxidized forms, is an 
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essential mineral nutrient for plant and animal growth (see e.g., Schmidt and Jager, 1992. 
Ann. Rev. Plant Physiol. Plant Mol. Biol 43: 325-349). In both plants and 
microorganisms, active uptake of sulfate is followed by reduction to sulfide. As sulfate 
has a very low oxidation/reduction potential relative to available cellular rcductants. the 
5 primary step in assimilation requires its activation via an ATP-dependent reaction (see 
e.g.. Leyh, 1993. Crit. Rev. Biochem. Mol Biol 28: 515-542). ATP sulfurylase (ATP: 
sulfate adenylyltransferase; EC 2.7.7.4) catalyzes the initial reaction in the metabolism of 
inorganic sulfate (SCV 2 ); see e.g., Robbins and Lipmann, 1958. /. Biol Chem. 233: 686- 
690; Hawes and Nicholas, 1973. Biochem. J. 133: 541-550). In this reaction S0 4 * 2 is 

10 activated to adenosine 5'-phosphosulfate (APS). 

ATP sulfurylase has been highly purified from several sources, such as 
Saccharomyces cerevisiae (see e.g., Hawes and Nicholas, 1973. Biochem, J. 133: 541- 
550); Penicillium chrysogenum (see e.g., Renosto, et al, 1990. J. Biol Chem. 265: 
10300-10308); rat liver (see e.g., Yu, et al, 1989. Arch. Biochem. Biophys. 269: 165- 

15 174); and plants (see e.g.. Shaw and Anderson, 1972. Biochem. J. 127: 237-247; 

Osslund, et al, 1982. Plant Physiol. 70: 39-45). Furthermore, ATP sulfurylase genes 
have been cloned from prokaryotes (see e.g.. Leyh, et aL 1992. J. Biol Chem. 267: 
1 0405- 1 04 1 0; Schwedock and Long, 1 989. Mol Plant Microbe Interaction 2: 1 8 1 - 1 94; 
Laue and Nelson, 1994. J. Bacteriol. 176: 3723-3729); eukaryotes (see e.g., Cherest, et 

20 al, 1987. Mol Gen. Genet. 210: 307-313; Mountain and Korch, 1991. Yeast 7: 873-880; 
Foster, et al, 1994. J. Biol Chem. 269: 19777-19786); plants (see e.g., Leustek, et al, 
1994. Plant Physiol 105: 897-90216); and animals (see e.g.. Li, et al, 1995. J. Biol 
Chem. 270: 29453-29459). The enzyme is a homo-oligomer or heterodimer, depending 
upon the specific source (see e.g., Leyh and Suo, 1992. J. Biol Chem. 267: 542-545). 

25 In some embodiments, a thermostable sulfurylase is used. Thermostable 

sulfurylases can be obtained from, e.g., Archaeoglobus or Pyrococcus spp. Sequences of 
thermostable sulfurylases are available at database Acc. No. 028606, Acc. No. Q9YCR4, 
and Acc. No. P56863. 

ATP sulfurylase has been used for many different applications, for example, 

30 bioluminomelric detection of ADP at high concentrations of ATP (see e.g.. Schultz, et 
al, 1993. Anal Biochem. 215: 302-304); continuous monitoring of DNA polymerase 
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activity (see e.g., Nyrbn, 1987. Anal Biochem. 167: 235-238); and DNA sequencing (see 
e.g., Ronaghi, et al, 1996. Anal. Biochem. 242: 84-89; Ronaghi, et ai, 1998. Science 
281: 363-365; Ronaghi, et al, 1998. Anal. Biochem. 267: 65-71). 

Several assays have been developed for detection of the forward ATP sulfurylase 

5 reaction. The colorimetric molybdolysis assay is based on phosphate detection (see e.g., 
Wilson and Bandurski, 1958. J. Biol. Chem. 233: 975-981), whereas the continuous 
spectrophotometric molybdolysis assay is based upon the detection of NADH oxidation 
(see e.g., Seubert, et al, 1983. Arch. Biochem. Biophys. 225: 679-691; Seubert, et al, 
1985. Arch. Biochem. Biophys. 240: 509-523). The later assay requires the presence of 

10 several detection enzymes. In addition, several radioactive assays have also been 

described in the literature (see e.g.. Daley, et al, 1986. Anal Biochem. 157: 385-395). 
For example, one assay is based upon the detection of 32 PPi released from 32 P-labeled 
ATP (see e.g., Seubert, et al, 1985. Arch. Biochem. Biophys. 240: 509-523) and another 
on the incorporation of 35 S into [ 35 S]-labeled APS (this assay also requires purified APS 

15 kinase as a coupling enzyme; see e.g., Seubert, et al, 1 983. Arch. Biochem. Biophys. 225: 
679-691); and a third reaction depends upon the release of 35 S0 4 2 from [ 35 S]-labeled 
APS (see e.g., Daley, et al, 1986. Anal Biochem. 157: 385-395). 

For detection of the reversed ATP sulfiirylase reaction a continuous 
spectrophotometric assay (see e.g., Segc\,etaL 1987 Methods Enzymol 143: 334-349); 

20 a bioluminometric assay (see e.g., Balharry and Nicholas, 1 971. Anal Biochem. 40: 1- 
17); an 35 S0 4 ~ 2 release assay (see e.g.. Seubert, et al, 1985. Arch. Biochem. Biophys. 240: 
509-523); and a 32 PPi incorporation assay (see e.g., Osslund, et al, 1982. Plant Physiol 
70: 39-45) have been previously described. 

ATP produced by an ATP sulfurylase can be hydrolyzed using enzymatic 

25 reactions to generate light. Light-emitting chemical reactions (i.e., chemiluminescence) 
and biological reactions (i.e., bioluminescence) are widely used in analytical 
biochemistry for sensitive measurements of various metabolites. In bio luminescent 
reactions, the chemical reaction that leads to the emission of light is enzyme-catalyzed. 
For example, the luciferin-luciferase system allows for specific assay of ATP and the 

30 bacterial luciferase-oxidoreductase system can be used for monitoring of NAD(P)H. 
Both systems have been extended to the analysis of numerous substances by means of 
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coupled reactions involving the production or utilization of ATP or N AD(P)H (see e.g., 
Kricka, 1991. Chemiluminescent and bioluminescent techniques. Clin. C/wm. 37: 1472- 
1281). 

The development of new reagents have made it possible to obtain stable light 

5 emission proportional to the concentrations of ATP (see e.g., Lundin, 1 982. Applications 
of firefly luciferase In; Luminescent Assays (Raven Press, New York) or NAD(P)H (see 
e.g., Lovgren, et aL, Continuous monitoring of NADH-converting reactions by bacterial 
luminescence. J. Appl. Biochem. 4: 103-1 11). With such stable light emission reagents, 
it is possible to make endpoint assays and to calibrate each individual assay by addition 

10 of a known amount of ATP or NAD(P)H. In addition, a stable light-emitting system also 
allows continuous monitoring of ATP- or NAD(P)H-converting systems. 

Suitable enzymes for converting ATP into light include luciferases, e.g., insect 
luciferases. Luciferases produce light as an end-product of catalysis. The best known 
light-emitting enzyme is that of the firefly, Photinus pyralis (Coleoptera). The 

1 5 corresponding gene has been cloned and expressed in bacteria (see e.g., dc Wet, et aL, 
1985. Proc. Natl Acad Sci. USA 80: 7870-7873) and plants (sec e.g., Ow, et aL, 1986. 
Science 234: 856-859), as well as in insect (see e.g., Jha, et aL, 1990. FEBS Lett. 274: 
24-26) and mammalian cells (see e.g., de Wet, et aL, 1987. Mol. Cell. Biol 7: 725-7373; 
Keller, et aL, 1987. Proc. Natl. Acad. Sci. USA 82: 3264-3268). In addition, a number of 

20 luciferase genes from the Jamaican click beetle, Pyroplorus plagiophihalamus 

{Coleoptera), have recently been cloned and partially characterized (see e.g.. Wood, et 
aL, 1 989. Biolumin. Chemilumin. 4: 289-301; Wood, et aL, 1989. Science 244: 700- 
702). Distinct luciferases can sometimes produce light of different wavelengths, which 
may enable simultaneous monitoring of light emissions at different wavelengths. 

25 Accordingly, these aforementioned characteristics are unique, and add new dimensions 
with respect to the utilization of current reporter systems. 

Firefly luciferase catalyzes bioluminescence in the presence of luciferin, 
adenosine 5'-triphosphate (ATP), magnesium ions, and oxygen, resulting in a quantum 
yield of 0.88 (see e.g.. McElroy and Selinger, 1960. Arch. Biochem. Biophys. 88: 136- 

30 145). The firefly luciferase bioluminescent reaction can be utilized as an assay for the 
detection of ATP with a detection limit of approximately lxlO" 13 M (see e.g., Leach, 
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198 1 . ./. Appi Biochem. 3: 473-5 1 7). In addition, the overall degree of sensitivity and 
convenience of the luciferase-mediated detection systems have created considerable 
interest in the development of firefly luciferase-based biosensors (see e.g., Green and 
Kricka, 1984. Talanta 31: 173-176; Blum, etal. t 1989. J. Biolumin. Chemihwmu 4: 
5 543-550). 

Using the above-described enzymes, the sequence primer is exposed to a 
polymerase and a known dNTP. If the dNTP is incorporated onto the 3' end of the 
primer sequence, the dNTP is cleaved and a PPi molecule is liberated. The PPi is then 
converted to ATP with ATP sulfurylase. Preferably, the ATP sulfurylase is present at a 

10 sufficiently high concentration that the conversion of PPi proceeds with first-order 
kinetics with respect to PPi. In the presence of luciferase, the ATP is hydrolyzed to 
generate a photon. The reaction preferably has a sufficient concentration of luciferase 
present within the reaction mixture such that the reaction, ATP ADP + P0 4 3 " + photon 
(light), proceeds with first-order kinetics with respect to ATP. The photon can be 

15 measured using methods and apparatuses described below. In one embodiment, the PPi 
and a coupled sulfurylase/luciferase reaction is used to generate light for detection. In 
some embodiments, either or both the sulfurylase and luciferase are immobilized on one 
or more mobile solid supports disposed at each reaction site. 

The present invention thus permits PPi release to be detected during the 

20 polymerase reaction giving a real-time signal. The sequencing reactions may be 

continuously monitored in real-time. A procedure for rapid detection of PPi release is 
thus enabled by the present invention. The reactions have been estimated to take place in 
less than 2 seconds (Nyren and Lundin, supra). The rate limiting step is the conversion 
of PPi to ATP by ATP sulfurylase, while the luciferase reaction is fast and has been 

25 estimated to take less than 0.2 seconds. Incorporation rates for polymerases have also 
been estimated by various methods and it has been found, for example, that in the case of 
Klenow polymerase, complete incorporation of one base may take less than 0.5 seconds. 
Thus, the estimated total time for incorporation of one base and detection by this 
enzymatic assay is approximately 3 seconds. It will be seen therefore that very fast 

30 reaction times are possible, enabling real-time detection. The reaction times could further 
be decreased by using a more diermostable luciferase. 



61 



Express Mail: EV 399526138 US 21465-510 UTIL 

Mailed January 28, 2004 

For most applications it is desirable to use reagents free of contaminants like ATP 
and PPi. These contaminants may be removed by flowing the reagents through a pre- 
column containing apyrase and/-or pyrophosphatase bound to resin. Alternatively, the 
apyrase or pyrophosphatase can be bound to magnetic beads and used to remove 

5 contaminating ATP and PPi present in the reagents. In addition it is desirable to wash 
away diffusible sequencing reagents, e.g., unincorporated dNTPs, with a wash buffer. 
Any wash buffer used in pyrophosphate sequencing can be used. 

In some embodiments, the concentration of reactanls in the sequencing reaction 
include 1 pmol DNA, 3 pmol polymerase, 40 pmol dNTP in 0.2 ml buffer. See Ronaghi, 

10 etaL Anal Biochem. 242: 84-89 (1996). 

The sequencing reaction can be performed with each of four predetermined 
nucleotides, if desired. A "complete" cycle generally includes sequentially administering 
sequencing reagents for each of the nucleotides dATP, dGTP, dCTP and dTTP (or 
dUTP), in a predetermined order. Unincorporated dNTPs are washed away between each 

15 of the nucleotide additions. Alternatively, unincorporated dNTPs are degraded by 
apyrase (see below). The cycle is repeated as desired until the desired amount of 
sequence of the sequence product is obtained. In some embodiments, about 10-1000, 10- 
100, 10-75, 20-50, or about 30 nucleotides of sequence information is obtained from 
extension of one annealed sequencing primer. 

20 In some embodiments, the nucleotide is modified to contain a disulfide-derivative 

of a hapten such as biotin. The addition of the modified nucleotide to the nascent primer 
annealed to the anchored substrate is analyzed by a post-polynierization step that includes 
i) sequentially binding of, in the example where the modification is a biotin, an avidin- or 
streptavidin-conjugated moiety linked to an enzyme molecule, ii) the washing away of 

25 excess avidin- or streptavidin-linked enzyme, iii) the flow of a suitable enzyme substrate 
under conditions amenable to enzyme activity, and iv) the detection of enzyme substrate 
reaction product or products. The hapten is removed in this embodiment through the 
addition of a reducing agent. Such methods enable a nucleotide to be identified in a 
given target position, and the DNA to be sequenced simply and rapidly while avoiding 

30 the need for electrophoresis and the use of potentially dangerous radiolabels. 
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A preferred enzyme for detecting the hapten is horse-radish peroxidase. If 
desired, a wash buffer, can be used between the addition of various reactanls herein. 
Apyrase can be used to remove unreacted dNTP used to extend the sequencing primer. 
The wash buffer can optionally include apyrase. 

5 Example haptens, e.g., biotin, digoxygenin, the fluorescent dye molecules cy3 and 

cy5, and fluorescein, are incorporated at various efficiencies into extended DNA 
molecules. The attachment of the hapten can occur through linkages via the sugar, the 
base, and via the phosphate moiety on the nucleotide. Example means for signal 
amplification include fluorescent, electrochemical and enzymatic. In a preferred 

10 embodiment using enzymatic amplification, the enzyme, e.g. alkaline phosphatase (AP), 
horse-radish peroxidase (HRP), beta-galactosidase, luciferase, can include those for 
which light-generating substrates are known, and the means for detection of these light- 
generating (chemiluminescent) substrates can include a CCD camera. 

In a preferred mode, the modified base is added, detection occurs, and the hapten- 

1 5 conjugated moiety is removed or inactivated by use of either a cleaving or inactivating 
agent. For example, if the cleavable-linker is a disulfide, then the cleaving agent can be a 
reducing agent, for example dithiothreitol (DTT), beta-mercaptoethanol, etc. Other 
embodiments of inactivation include heat, cold, chemical denaturants, surfactants, 
hydrophobic reagents, and suicide inhibitors. 

20 Luciferase can hydrolyze dATP directly with concomitant release of a photon. 

This results in a false positive signal because the hydrolysis occurs independent of 
incorporation of the dATP into the extended sequencing primer. To avoid this problem, a 
dATP analog can be used which is incorporated into DNA, i.e., it is a substrate for a 
DNA polymerase, but is not a substrate for luciferase. One such analog is a-thio-dATP. 

25 Thus, use of a-thio-dATP avoids the spurious photon generation that can occur when 
dATP is hydrolyzed without being incorporated into a growing nucleic acid chain. 

Typically, the PPi-based detection is calibrated by the measurement of the light 
released following the addition of control nucleotides to the sequencing reaction mixture 
immediately after the addition of the sequencing primer. This allows for normalization of 

30 the reaction conditions. Incorporation of two or more identical nucleotides in succession 
is revealed by a corresponding increase in the amount of light released. Thus, a two-fold 
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increase in released light relative to control nucleotides reveals the incorporation of two 
successive dNTPs into the extended primer. 

If desired, apyrase may be "washed" or ''flowed" over the surface of the solid 
support so as to facilitate the degradation of any remaining, non-incorporated dNTPs 
5 within the sequencing reaction mixture. Apyrase also degrades the generated ATP and 
hence '"turns off" the light generated from the reaction. Upon treatment with apyrase, any 
remaining reactanls are washed away in preparation for the following dNTP incubation 
and photon detection steps. Alternatively, the apyrase may be bound to the solid or 
mobile solid support. 

10 

Double Ended Sequencing 

In a preferred embodiment we provide a method for sequencing from both ends of 
a nucleic acid template. Traditionally, the sequencing of two ends of a double stranded 
DNA molecule would require at the very least the hybridization of primer, sequencing of 

1 5 one end, hybridization of a second primer, and sequencing of the other end. The 

alternative method is to separate the individual strands of the double stranded nucleic 
acid and individually sequence each strand. The present invention provides a third 
alternative that is more rapid and less labor intensive than the first two methods. 

The present invention provides for a method of sequential sequencing of nucleic 

20 acids from multiple primers. References to DNA sequencing in this application are 
directed to sequencing using a polymerase wherein the sequence is determined as the 
nucleotide triphosphate (NTP) is incorporated into the growing chain of a sequencing 
primer. One example of this type of sequencing is the pyro-sequencing detection 
pyrophosphate method (see, e.g., U.S. Patents 6,274,320, 6258,568 and 6,210,891, each 

25 of which is incorporated in total herein by reference.). 

In one embodiment, the present invention provides for a method for sequencing 
two ends of a template double stranded nucleic acid. The double stranded DNA is 
comprised of two single stranded DNA; referred to herein as a first single stranded DNA 
and a second single stranded DNA. A first primer is hybridized to the first single 

30 stranded DNA and a second primer is hybridized to the second single stranded DNA. 
The first primer is unprotected while the second primer is protected. 'Protection" and 
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"protected" are defined in this disclosure as being the addition of a chemical group to 
reactive sites on the primer that prevents a primer from polymerization by DNA 
polymerase. Further, the addition of such chemical protecting groups should be 
reversible so that after reversion, the now deprotected primer is once again able to serve 
5 as a sequencing primer. The nucleic acid sequence is determined in one direction (e.g., 
from one end of the template) by elongating the first primer with DNA polymerase using 
conventional methods such as pyrophosphate sequencing. The second primer is then 
deprotected, and the sequence is determined by elongating the second primer in the other 
direction {e.g., from the other end of the template) using DNA polymerase and 

10 conventional methods such as pyrophosphate sequencing. The sequences of the first and 
second primers are specifically designed to hybridize to the two ends of the double 
stranded DNA or at any location along the template in this method. 

In another embodiment, the present invention provides for a method of 
sequencing a nucleic acid from multiple primers. In this method a number of sequencing 

15 primers are hybridized to the template nucleic acid to be sequenced. All the sequencing 
primers are reversibly protected except for one. A protected primer is an oligonucleotide 
primer that cannot be extended with polymerase and dNTPs which are commonly used in 
DNA sequencing reactions. A reversibly protected primer is a protected primer which 
can be deprotected. All protected primers referred to in this invention are reversibly 

20 protected. After deprotection, a reversibly protected primer functions as a normal 
sequencing primer and is capable of participating in a normal sequencing reaction. 

The present invention provides for a method of sequential sequencing a nucleic 
acid from multiple primers. The method comprises the following steps: First, one or 
more template nucleic acids to be sequenced are provided. Second, a plurality of 

25 sequencing primers arc hybridized to the template nucleic acid or acids. The number of 
sequencing primers may be represented by the number n where n can be any positive 
number greater than 1 . That number may be, for example, 2, 3, 4, 5, 6, 7, 8, 9, 10 or 
greater. Of the primers, n-1 number may be protected by a protection group. So, for 
example, if n is 2, 3, 4, 5, 6, 7, 8, 9 or 10, n-1 would be 1, 2, 3, 4, 5, 6, 7, 8, 9 

30 respectively. The remaining primer (e.g., n number primers - (n-1) number of protected 
primers = one remaining primer) is unprotected. Third, the unprotected primer is 
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extended and the template DNA sequence is determined by conventional methods such 
as, for example, pyrophosphate sequencing. Fourth, after the sequencing of the first 
primer, one of the remaining protected primers is unprotected. Fifth, unprotected primer 
is extended and the template DNA sequence is determined by conventional methods such 
5 as, for example, pyrophosphate sequencing. Optionally, the method may be repeated 
until sequencing is performed on all the protected primers. 

In another aspect, the present invention includes a method of sequential 
sequencing of a nucleic acid comprising the steps of: (a) hybridizing 2 or more 
sequencing primers to the nucleic acid wherein all the primers except for one are 

10 reversibly protected; (b) determining a sequence of one strand of the nucleic acid by 
polymerase elongation from the unprotected primer; (c) deprotecting one of the 
reversibly protected primers into an unprotected primer; (d) repeating steps (b) and (c) 
until all the reversibly protected primers are deprotected and used for determining a 
sequence. In one embodiment, this method comprises one additional step between steps 

1 5 (b) and (c), i.e., the step of terminating the elongation of the unprotected primer by 

contacting the unprotected primer with DNA polymerase and one or more of a nucleotide 
triphosphate or a dideoxy nucleotide triphosphate. In yet another embodiment, this 
method further comprises an additional step between said step (b) and (c), i.e., 
terminating the elongation of the unprotected primer by contacting the unprotected 

20 primer with DNA polymerase and a dideoxy nucleotide triphosphate from ddATP, 
ddTTP, ddCTP, ddGTP or a combination thereof. 

In another aspect, this invention includes a method of sequencing a nucleic acid 
comprising: (a) hybridizing a first unprotected primer to a first strand of the nucleic acid; 
(b) hybridizing a second protected primer to a second strand; (c) exposing the first and 

25 second strands to polymerase, such that the first unprotected primer is extended along the 
first strand; (d) completing the extension of the first sequencing primer; (e) deprotecting 
the second sequencing primer, and (0 exposing the first and second strands to 
polymerase so that the second sequencing primer is extended along the second strand. In 
a preferred embodiment, completing comprises capping or terminating the elongation. 

30 In another embodiment, the present invention provides for a method for 

sequencing two ends of a template double stranded nucleic acid that comprises a first and 
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a second single stranded DNA. In this embodiment, a first primer is hybridized to the 
first single stranded DNA and a second primer is hybridized to the second single stranded 
DNA in the same step. The first primer is unprotected while the second primer is 
protected. 

5 Following hybridization, the nucleic acid sequence is determined in one direction 

(e.g., from one end of the template) by elongating the first primer with DNA polymerase 
using conventional methods such as pyrophosphate sequencing. In a preferred 
embodiment, the polymerase is devoid of 3' to 5* exonuclease activity. The second 
primer is then deprotected, and its sequence is determined by elongating the second 

10 primer in the other direction (e.g., from the other end of the template) with DNA 
polymerase using conventional methods such as pyrophosphate sequencing. As 
described earlier, the sequences of the first primer and the second primer are designed to 
hybridize to the two ends of the double stranded DNA or at any location along the 
template. This technique is especially useful for the sequencing of many template DNAs 

15 that contain unique sequencing primer hybridization sites on its two ends. For example, 
many cloning vectors provide unique sequencing primer hybridization sites flanking the 
insert site to facilitate subsequent sequencing of any cloned sequence (e.g., Bluescript, 
Stratagene, La Jolla, CA). 

One benefit of this method of the present invention is that both primers may be 

20 hybridized in a single step. The benefits of this and other methods are especially useful 
in parallel sequencing systems where hybridizations are more involved than normal. 
Examples of parallel sequencing systems are disclosed in copending U.S. patent 
application serial number 10/104,280, the disclosure of which is incorporated in total 
herein. 

25 The oligonucleotide primers of the present invention may be synthesized by 

conventional technology, e.g., with a commercial oligonucleotide synthesizer and/or by 

ligating together subfragments that have been so synthesized. 

In another embodiment of the invention, the length of the double stranded target 

nucleic acid may be determined. Methods of determining the length of a double stranded 
30 nucleic acid are known in the art. The length determination may be performed before or 

after the nucleic acid is sequenced. Known methods of nucleic acid molecule length 
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determination include gel electrophoresis, pulsed field gel electrophoresis, mass 
spectroscopy and the like. Since a blunt ended double stranded nucleic acid is comprised 
of two single strands of identical lengths, the determination of the length of one strand of 
a nucleic acid is sufficient to determine the length of the corresponding double strand. 

5 The sequence reaction according to the present invention also allows a 

determination of the template nucleic acid length. First, a complete sequence from one 
end of the nucleic acid to another end will allow the length to be determined. Second, the 
sequence determination of the two ends may overlap in the middle allowing the two 
sequences to be linked. The complete sequence may be determined and the length may 

10 be revealed. For example, if the template is 100 bps long, sequencing from one end may 
determine bases 1 to 75; sequencing from the other end may determine bases 25 to 100; 
there is thus a 5 1 base overlap in the middle from base 25 to base 75; and from this 
information, the complete sequence from 1 to 100 may be determined and the length, of 
100 bases, may be revealed by the complete sequence. 

15 Another method of the present invention is directed to a method comprising the 

following steps. First a plurality of sequencing primers, each with a different sequence, 
is hybridized to a DNA to be sequenced. The number of sequencing primers may be any 
value greater than one such as, for example, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more. All of these 
primers are reversibly protected except for one. The one unprotected primer is elongated 

20 in a sequencing reaction and a sequence is determined. Usually, when a primer is 

completely elongated, it cannot extend and will not affect subsequent sequencing from 
another primer. If desired, the sequenced primer may be terminated using excess 
polymerase and dNTP or using ddNTPs. If a termination step is taken, the termination 
reagents (dNTPs and ddNTPs) should be removed after the step. Then, one of the 

25 reversibly protected primers is unprotected and sequencing from the second primer 
proceeds. The steps of deprotecting a primer, sequencing from the deprotecled primer, 
and optionally, terminating sequencing from the primer is repeated until all the protected 
primers are unprotected and used in sequencing. 

The reversibly protected primers should be protected with different chemical 

30 groups. By choosing the appropriate method of deprotection, one primer may be 

deprotected without affecting the protection groups of the other primers. In a preferred 
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embodiment, the protection group is P0 4 . That is, the second primer is protected by PO4 
and deprotection is accomplished by T4 polynucleotide kinase (utilizing its 3*- 
phosphatase activity). In another preferred embodiment, the protection is a thio group or 
a phosphorothiol group. 
5 The template nucleic acid may be a DN A, RNA, or peptide nucleic acid (PNA). 

While DNA is the preferred template, RNA and PNA may be converted to DNA by 
known techniques such as random primed PCR, reverse transcription, RT-PCR or a 
combination of these techniques. Further, the methods of the invention are useful for 
sequencing nucleic acids of unknown and known sequence. The sequencing of nucleic 

10 acid of known sequence would be useful, for example, for confirming the sequence of 
synthesized DNA or for confirming the identity of suspected pathogen with a known 
nucleic acid sequence. The nucleic acids may be a mixture of more than one population 
of nucleic acids. It is known that a sequencing primer with sufficient specificity (e.g., 20 
bases, 25 bases, 30 bases, 35 bases, 40 bases, 45 bases, or 50 bases) may be used to 

1 5 sequence a subset of sequences in a long nucleic acid or in a population of unrelated 
nucleic acids. Thus, for example, the template may be one sequence of 10 Kb or ten 
sequences of 1 Kb each. In a preferred embodiment, the template DNA is between 50 bp 
to 700 bp in length. The DNA can be single stranded or double stranded. 

In the case where the template nucleic acid is single stranded, a number of 

20 primers may be hybridized to the template nucleic acid as shown below: 

5 , --primer4-3 > S'-primer 3 -3' S'-primer 2 -3 ; 5'-primcr 1-3' 

3' template nucleic acid 5 1 

In this case, it is preferred that the initial unprotected primer would be the primer 
that hybridizes at the most 5' end of the template. See primer 1 in the above illustration. 

25 In this orientation, the elongation of primer 1 would not displace (by strand displacement) 
primer 2, 3, or 4. When sequencing from primer 1 is finished, primer 2 can be 
unprotected and nucleic acid sequencing can commence. The sequencing from primer 2 
may displace primer 1 or the elongated version of primer one but would have no effect on 
the remaining protected primers (primers 3 and 4). Using this order, each primer may be 

30 used sequentially and a sequencing reaction from one primer would not affect the 
sequencing from a subsequent primer. 
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One feature of the invention is the ability to use multiple sequencing primers on 
one or more nucleic acids and the ability to sequence from multiple primers using only 
one hybridization step. In the hybridization step, all the sequencing primers (e.g., the n 
number of sequencing primers) may be hybridized to the template nucleic acid(s) at the 

5 same time. In conventional sequencing, usually one hybridization step is required for 
sequencing from one primer. One feature of the invention is that the sequencing from n 
primers (as defined above) may be performed by a single hybridization step. This 
effectively eliminates n-1 hybridization step. 

In a preferred embodiment, the sequences of the n number of primers are 

10 sufficiently different that the primers do not cross hybridize or self-hybridize. Cross 
hybridization refers to the hybridization of one primer to another primer because of 
sequence complementarity. One form of cross hybridization is commonly referred to as a 
"primer dimer." In the case of a primer dimer, the 3' ends of two primers are 
complementary and form a structure that when elongated, is approximately the sum of the 

15 length of the two primers. Self-hybridization refers to the situation where the 5' end of a 
primer is complementary to the 3' end of the primer. In that case, the primer has a 
tendency to self hybridize to form a hairpin-like structure. 

A primer can interact or become associated specifically with the template 
molecule. By the terms "interact" or "associate", it is meant herein that two substances or 

20 compounds (e.g., primer and template; chemical moiety and nucleotide) are bound (e.g., 
attached, bound, hybridized, joined, annealed, covalently linked, or otherwise associated) 
to one another sufficiently that the intended assay can be conducted. By the terms 
"specific" or "specifically", it is meant herein that two components bind selectively to 
each other. The parameters required to achieve specific interactions can be determined 

25 routinely, e.g., using conventional methods in the art. 

To gain more sensitivity or to help in the analysis of complex mixtures, the 
protected primers can be modified (e.g., derivatized) with chemical moieties designed to 
give clear unique signals. For example, each protected primer can be derivatized with a 
different natural or synthetic amino acid attached through an amide bond to the 

30 oligonucleotide strand at one or more positions along the hybridizing portion of the 
strand. The chemical modification can be detected, of course, either afler having been 
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cleaved from the target nucleic acid, or while in association with the larget nucleic acid. 
By allowing each protected target nucleic acid to be identified in a distinguishable 
manner, it is possible to assay (e.g., to screen) for a large number of different target 
nucleic acids in a single assay. Many such assays can be performed rapidly and easily. 
5 Such an assay or set of assays can be conducted, therefore, with high throughput 
efficiency as defined herein. 

In the methods of the invention, after a first primer is elongated and the sequence 
of the template DNA is determined, a second primer is deprotected and sequenced. There 
is no interference between the sequencing reaction of the first primer with the sequencing 

10 reaction of the second, now unprotected, primer because the first primer is completely 
elongated or terminated. Because the first primer is completely elongated, the 
sequencing from the second primer, using conventional methods such a pyrophosphate 
sequencing, will not be affected by the presence of the elongated first primer. The 
invention also provides a method of reducing any possible signal contamination from the 

15 first primer. Signal contamination refers to the incidences where the first primer is not 
completely elongated. In that case, the first primer will continue to elongate when a 
subsequent primer is deprotected and elongated. The elongation of both the first and 
second primers may interfere with the determination of DNA sequence. 

In a preferred embodiment, the sequencing reaction (e.g., the chain elongation 

20 reaction) from one primer is first terminated or completed before a sequencing reaction is 
started on a second primer. A chain elongation reaction of DNA can be terminated by 
contacting the template DNA with DNA polymerase and dideoxy nucleotide 
triphosphates (ddNTPs) such as ddATP, ddTTP, ddGTP and ddCTP. Following 
termination, the dideoxy nucleotide triphosphates may be removed by washing the 

25 reaction with a solution without ddNTPs. A second method of preventing further 

elongation of a primer is to add nucleotide triphosphates (dNTPs such as dATP, dTTP, 
dGTP and dCTP) and DNA polymerase to a reaction to completely extend any primer 
that is not completely extended. Following complete extension, the dNTPs and the 
polymerases are removed before the next primer is deprotected. By completing or 

30 terminating one primer before deprotecting another primer, the signal to noise ratio of the 
sequencing reaction (e.g., pyrophosphate sequencing) can be improved. 
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The steps of (a) optionally terminating or completing the sequencing, (b) 

deprotecting a new primer, and (c) sequencing from the deprotected primer may be 

repeated until a sequence is determined from the elongation of each primer. In this 

method, the hybridization step comprises "n" number of primers and one unprotected 

5 primer. The unprotected primer is sequenced first and the steps of (a), (b) and (c) above 

may be repeated. 

In a preferred embodiment, pyrophosphate sequencing is used for all sequencing 
conducted in accordance with the method of the present invention. 

In another preferred embodiment, the double ended sequencing is performed 

1 0 according to the process outlined in Figure 1 0. This process may be divided into six 
steps: (1) creation of a capture bead (Figure 10A); (2) drive to bead (DTB) PCR 
amplification (Figure 10B); (3) SL reporter system preparation (Figure IOC); (4) 
sequencing of the first strand (Figure 10D); (5) preparation of the second strand (Figures 
10E and 10F); and (6) analysts of each strand (Figure 10G). This exemplary process is 

15 outlined below. 

In step I, an N-hydroxysuccinimide (NHS)-aclivated capture bead (e.g., 
Amershani Biosciences, Piscataway, NJ) is coupled to both a forward primer and a 
reverse primer. NHS coupling forms a chemically stable amide bond with ligands 
containing primary amino groups. The capture bead is also coupled to biotin (Figure 

20 10A). The beads (i.e., solid nucleic acid capturing supports) used herein may be of any 
convenient size and fabricated from any number of known materials. Example of such 
materials include: inorganics, natural polymers, and synthetic polymers. Specific 
examples of these materials include: cellulose, cellulose derivatives, acrylic resins, glass; 
silica gels, polystyrene, gelatin, polyvinyl pyrrolidone, co-polymers of vinyl and 

25 acrylamide, polystyrene cross-linked with divinylbenzene or the like (see, Merrifleld 
Biochemistry 1964, 3, 1385-1390), polyacrylamides, latex gels, polystyrene, dextran. 
rubber, silicon, plastics, nitrocellulose, celluloses, natural sponges, silica gels, glass, 
metals plastic, cellulose, cross-linked dextrans (e.g., Sephadex™) and agarose gel 
(Sepharose™) and solid phase supports known to those of skill in the art. In a preferred 

30 embodiment, the capture beads are Sepharose beads approximately 25 to 40 \iM in 
diameter. 
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In step 2, template DNA which has hybridized to the forward and reverse primers 

is added, and the DNA is amplified through a PCR amplification strategy (Figure 10B). 

In one embodiment, the DNA is amplified by Emulsion Polymerase Chain Reaction, 

Drive to Bead Polymerase Chain Reaction, Rolling Circle Amplification or Loop- 

5 mediated Isothermal Amplification. In step 3, streptavidin is added followed by the 
addition of sulfurylase and luciferase which are coupled to the streptavidin (Figure 10C). 
The addition of auxiliary enzymes during a sequencing method has been disclosed in 
U.S.S.N. 10/104,280 and U.S.S.N. 10/127,906, which are incorporated herein in their 
entireties by reference. In one embodiment, the template DNA has a DNA adaptor 

10 ligated to both the 5' and 3* end. In a preferred embodiment, the DNA is coupled to the 
DNA capture bead by hybridization of one of the DNA adaptors to a complimentary 
sequence on the DNA capture bead. 

In the first step, single stranded nucleic acid template to be amplified is attached 
to a capture bead. The nucleic acid template may be attached to the capture bead in any 

15 manner known in the art. Numerous methods exist in the art for attaching tine DNA lo a 
microscopic bead. Covalcnt chemical attachment of the DNA to the bead can be 
accomplished by using standard coupling agents, such as water-soluble carbodiimide, to 
link the 5'-phosphate on the DNA to amine-coated microspheres through a 
phosphoamidate bond. Another alternative is to first couple specific oligonucleotide 

20 linkers to the bead using similar chemistry, and to then use DNA ligase to link the DNA 
to the linker on the bead. Other linkage chemistries include the use of N- 
hydroxysuccinamide (NHS) and its derivatives, to join the oligonucleotide to the beads. 
In such a method, one end of the oligonucleotide may contain a reactive group (such as 
an amide group) which forms a covalent bond with the solid support, while the other end 

25 of the linker contains another reactive group which can bond with the oligonucleotide to 
be immobilized. In a preferred embodiment, the oligonucleotide is bound to the DNA 
capture bead by covalent linkage. However, non-covalent linkages, such as chelation or 
antigen-antibody complexes, may be used to join the oligonucleotide to the bead. 

Oligonucleotide linkers can be employed which specifically hybridize to unique 

30 sequences at the end of the DNA fragment, such as the overlapping end from a restriction 
enzyme site or the "sticky ends" of bacteriophage lambda based cloning vectors, but 
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blunt-end ligations can also be used beneficially. These methods are described in detail 
in US 5,674,743, the disclosure of which is incorporated in toto herein. It is preferred 
that any method used to immobilize the beads will continue to bind the immobilized 
oligonucleotide throughout the steps in the methods of the invention. In a preferred 
5 embodiment, the oligonucleotide is bound to the DNA capture bead by covalent linkage. 
However, non-covalent linkages, such as chelation or antigen-antibody complexes, may 
be used to join the oligonucleotide to the bead. 

In step 4, the first strand of DNA is sequenced by depositing the capture beads 
onto a PicoTiter plate (PTP), and sequencing by a method known to one of ordinary skill 

10 in the art (e.g., pyrophosphate sequencing) (Figure 10D). Following sequencing, a 
mixture of dNTPs and ddNTPs are added in order to "cap" or terminate the sequencing 
process (Figure 10E). In step 5, the second strand of nucleic acid is prepared by adding 
apyrase to remove the ddNTPs and polynucleotide kinase (PNK) to remove the 3' 
phosphate group from the blocked primer strand (Figure 10F). Polymerase is then added 

15 to prime the second strand followed by sequencing of the second strand according to a 
standard method known to one of ordinary skill in the art (Figure 10G). In step 7, the 
sequence of the both the first and second strand is analyzed such that a contiguous DNA 
sequence is determined. 

20 Detection means 

The solid support is optically linked to an imaging system 230, which includes a 
CCD system in association with conventional optics or a fiber optic bundle. In one 
embodiment the perfusion chamber substrate includes a fiber optic array wafer such that 
light generated near the aqueous interface is transmitted directly through the optical fibers 

25 to the exterior of the substrate or chamber. When the CCD system includes a fiber optic 
connector, imaging can be accomplished by placing the perfusion chamber substrate in 
direct contact with the connector. Alternatively, conventional optics can be used to 
image the light, e.g., by using a 1-1 magnification high numerical aperture lens system, 
from the exterior of the liber optic substrate directly onto the CCD sensor. When the 

30 substrale does not provide for fiber optic coupling, a lens system can also be used as 
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described above, in which case either the substrate or the perfusion chamber cover is 
optically transparent. An exemplary CCD imaging system is described above. 

The imaging system 230 is used to collect light from the reactors on the substrate 
surface. Light can be imaged, for example, onto a CCD using a high sensitivity low 

5 noise apparatus known in the art. For Tiber-optic based imaging, it is preferable to 

incorporate the optical fibers directly into the cover slip or for a FORA to have the optical 
fibers that form the microwells also be the optical fibers that convey light to the detector. 

The imaging system is linked to a computer control and data collection system 
240. In general, any commonly available hardware and software package can be used. 

1 0 The computer control and data collection system is also linked to the conduit 200 to 
control reagent delivery. 

The photons generated by the pyrophosphate sequencing reaction are captured by 
the CCD only if they pass through a focusing device (e.g., an optical lens or optical fiber) 
and are focused upon a CCD element. However, the emitted photons will escape equally 

1 5 in all directions. In order to maximize their subsequent "capture" and quantitation when 
utilizing a planar array (e.g., a DNA chip), it is preferable to collect the photons as close 
as possible to the point at which they are generated, e.g. immediately at the planar solid 
support. This is accomplished by either: (/) utilizing optical immersion oil between the 
cover slip and a traditional optical lens or optical fiber bundle or, preferably, 

20 (//) incorporating optical fibers directly into the cover slip itself. Similarly, when a thin, 
optically transparent planar surface is used, the optical fiber bundle can also be placed 
against its back surface, eliminating the need to "image" through the depth of the entire 
reaction/perfiision chamber. 

The reaction event, e.g., photons generated by luciferase, may be detected and 

25 quantified using a variety of detection apparatuses, e.g., a photomultiplier tube, a CCD, 
CMOS, absorbance photometer, a luminometer, charge injection device (CID), or other 
solid state detector, as well as the apparatuses described herein. In a preferred 
embodiment, the quantitation of the emitted photons is accomplished by the use of a 
CCD camera fitted with a fused fiber optic bundle. In another preferred embodiment, the 

30 quantitation of the emitted photons is accomplished by the use of a CCD camera fitted 
with a microchanncl plate intensifier. A back-thinned CCD can be used to increase 
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sensitivity. CCD detectors are described in, e.g., Bronks, et ai, 1995. Anal. Client 65: 
2750-2757. 

An exemplary CCD system is a Spectral Instruments, Inc. (Tucson, AZ) Series 
600 4-port camera with a Lockheed-Martin LM485 CCD chip and a 1-1 fiber optic 

5 connector (bundle) with 6-8 |im individual fiber diameters. This system has 4096 x 

4096, or greater than 16 million pixels and has a quantum efficiency ranging from 10% to 
> 40%. Thus, depending on wavelength, as much as 40% of the photons imaged onto the 
CCD sensor are converted to detectable electrons. 

In other embodiments, a fluorescent moiety can be used as a label and the 

10 detection of a reaction event can be carried out using a confocal scanning microscope to 
scan the surface of an array with a laser or other techniques such as scanning near-field 
optical microscopy (SNOM) are available which are capable of smaller optical resolution, 
thereby allowing the use of ' more dense" arrays. For example, using SNOM, individual 
polynucleotides may be distinguished when separated by a distance of less than 100 nm, 

15 e.g.. lOnm x lOnm. Additionally, scanning tunneling microscopy (Binning el al., 

Helvetica Physica Acta, 55:726-735, 1982) and atomic force microscopy (Hanswa et al., 
Annu Rev Biophys Biotnol Struct, 23: 1 1 5-139, 1994) can be used. 

Haplotype Application 

20 Virtually any sequencing application can be accomplished using the methods and 

apparatus of this invention. In one embodiment we contemplate haplotype mapping. 
Human gene diversity is an important factor in the variability of patient response to 
pharmaceuticals. The most precise measurement of this diversity is the haplotype, which 
is the organization of polymorphic variation as it is found on a chromosome. Recently, 

25 major government and academic genome researchers in the US, Canada and Europe have 
agreed that haplotypes arc a powerful tool that can reduce the complexity of genetic 
information to a practical form. Haplotypes can be used in drug discovery to improve the 
outcome of target validation and drug screening studies and in drug development to 
improve the design and reliability of clinical trials. Haplotype markers can be used to 

30 predict the efficacy and safety of new and approved drugs and will serve as the 
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foundation for a new paradigm of personalized medicine matching patients to the right 
drug at the right dose via guidance from a database of clinical marker-associations. 

Numerous empirical studies have shown that nearby SNP alleles are often in 
linkage disequilibrium (LD) with each other, such that the state of one SNP allele is often 
5 highly correlated with the allele of another close SNP. These correlations exist because of 
the shared history of tightly linked SNP's ? which are co-transmitted from generation to 
generation. Patterns of human sequence variation (haplotypes) thus represent ancestral 
DNA segments. Historical meioses have slowly dissociated alleles from neighboring 
alleles on ancestral chromosomes, except for tightly linked variants. The extent of linkage 

1 0 disequilibrium in founder populations with recent bottlenecks had been the object of 
numerous studies - particularly in the cloning of simple Mendelian disorders disorders 
such as cystic fibrosis (16), Huntington's disease (11), diastrophic dysplasia (DTD) (8). 
Whereas these cloning studies benefited from the large chromosomal segments showing 
LD spanning over large distances (often in the megabase range), very little empirical data 

1 5 was available until recently regarding LD across the human genome in the world 
population. 

We focus on three recent examples of large-scale surveys of LD (and haplotypes): 
(see, e.g, Reich, D. E., Cargill, M., Bolk, S., Ireland, J., Sabeti, P. C, Richter, D. J., 
Lavery, T., Kouyoumjian, R., Farhadian, S. R, Ward, R. & Lander, E. S. 2001. Linkage 

20 disequilibrium in the human genome. Nature 411,1 99-204.26). We sampled 1 9 

chromosome regions for their SNP content. High frequency SNP's spanning intervals of 2 
to 160 kb were first geno typed in a Caucasian samples. Over all regions, LD was 
detectable at distances of about 60kb, with a significant difference between regions, as 
the range was as short as 6kb at one locus and as long 155 kb in another. Not 

25 surprisingly, LD was significantly correlated with the estimated local recombination 
rates. Further analysis in a Nigerian sample provided evidence of shorter LD in this 
population - although the allelic combinations over short distances were similar to the 
Caucasian sample. Overall - this work provided evidence that large blocks of LD are 
common across the human genome, and that genome-wide LD mapping of disease genes 

30 will be feasible. 
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Kits 

The invention also comprises kits for use in methods of the invention which could 
include one or more of the following components: (a) a test specific primer which 
hybridizes to sample DNA so that the target position is directly adjacent to the 3' end of 
5 the primer; (b) a polymerase; (c) detection enzyme means for identifying PPi release; 
(d) deoxynucleotides including, in place of dATP, a dATP analogue which is capable of 
acting as a substrate for a polymerase but incapable of acting as a substrate for a said PPi- 
detection enzyme: and (e) optionally dideoxynucleotides, optionally ddATP being 
replaced by a ddATP analogue which is capable of acting as a substrate for a polymerase 

10 but incapable of acting as a substrate for a said PPi-detection enzyme. If the kit is for use 
with initial PCR amplification then it could also include the following components: (i) a 
pair of primers for PCR, at least one primer having means permitting immobilization of 
said primer; (ii) a polymerase which is preferably heat stable, for example Taql 
polymerase; (iii) buffers for the PCR reaction; and (iv) deoxynucleotides. Where an 

15 enzyme label is used to evaluate PCR, the kit will advantageously contain a substrate for 
the enzyme and other components of a detection system. 

One embodiment of the invention is directed to a method for sequencing nucleic 
acids. The method involves fragmenting large template nucleic acid molecules to 
generate a plurality of fragmented nucleic acids. Then the fragmented nucleic acids are 

20 delivered into aqueous microreactors in a water-in-oil emulsion such that a plurality of 
aqueous microreactors comprise a single copy of a fragmented nucleic acid, a single bead 
capable of binding to the fragmented nucleic acid, and amplification reaction solution 
containing reagents necessary to perform nucleic acid amplification. In the next step, the 
fragmented nucleic acids is amplified in the microreactors to form amplified copies of the 

25 nucleic acids and binding the amplified copies to beads in the microreactors. Next, the 
beads are delivered to an array of at least 10,000 reaction chambers on a planar surface, 
wherein a plurality of the reaction chambers comprise no more than a single bead. 
Finally, a sequencing reaction is performed simultaneously on a plurality of the reaction 
chambers. 

30 Another embodiment of the invention is directed to an array comprising a planar 

surface with a plurality of cavities thereon, each cavity forming an analyte reaction 
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chamber, wherein ihe reaction chambers have a center to center spacing of between 20 to 
100 urn and each cavity has a width in at least one dimension of between 20 urn and 
70 urn. Further, there are at least 10,000 reaction chambers in the array. Each reaction 
chambers may contain at least 100,000 copies of a single species of single stranded 

5 nucleic acid template. 

Another embodiment of the invention is directed to an array comprising a planar 
top surface and a planar bottom surface wherein the planar top surface has at least 10,000 
cavities thereon, each cavity forming an analyte reaction chamber, the planar bottom 
surface is optically conductive such that optical signals from the reaction chambers can 

10 be detected through the bottom planar surface, wherein the distance between the top 
surface and the bottom surface is no greater than 5 mm, wherein the reaction chambers 
have a center to center spacing of between 20 to 100pm and each chamber having a width 
in at least one dimension of between 20pm and 70pm. The distance between the top 
surface and the bottom surface, in one embodiment, is no greater than 2 mm. 

15 Another embodiment of the invention is directed to an array means for carrying 

out separate parallel common reactions in an aqueous environment. The array means 
may comprise a substrate comprising at least 10,000 discrete reaction chambers 
containing a starting material that is capable of reacting with a reagent, each of the 
reaction chambers being dimensioned such that when one or more fluids containing at 

20 least one reagent is delivered into each reaction chamber, the diffusion time for the 
reagent to diffuse out of the well exceeds the time required for the starting material to 
react with the reagent to form a product. 

Another embodiment of the invention is directed to a method for delivering a 
bioactive agent to an array. The method comprises dispersing over the array a plurality 

25 of mobile solid supports, each mobile solid support having at least one reagent 
immobilized thereon, wherein the reagent is suitable for use in a nucleic acid sequencing 
reaction, where the array comprises a planar surface with a plurality of reaction chambers 
disposed thereon. The the reaction chambers may have a center to center spacing of 
between 20 to 100 pm and each reaction chamber has a width in at least one dimension of 

30 between 20 pm and 70 pm. 
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Another embodiment of the invention is directed to an apparatus for 
simultaneously monitoring an array of reaction chambers for light indicating that a 
reaction is taking place at a particular site. The apparatus comprises (a) an array of 
reaction chambers formed from a planar substrate comprising a plurality of cavitated 
5 surfaces, each cavitated surface forming a reaction chamber adapted to contain analytes, 
and wherein the reaction chambers have a center to center spacing of between 20 to 
100 jim, each reaction chamber having a volume of between 10 to 150 pL, the array 
comprising more than 10,000 discrete reaction chambers; (b) an optically sensitive device 
arranged so that in use the light from a particular reaction chamber will impinge upon a 
10 particular predetermined region of the optically sensitive device; (c) means for 
determining the light level impinging upon each of the predetermined regions; and (d) 
means to record the variation of the light level with time for each of the reaction 
chamber. 

Another embodiment of the invention is directed to an analytic sensor, comprising 

15 (a) an array formed from a first bundle of optical fibers with a plurality of cavitated 
surfaces at one end thereof, each cavitated surface forming a reaction chamber adapted to 
contain analytes, and wherein the reaction chambers have a center to center spacing of 
between 20 to 100 urn, a width of 20 to 70 (im, the array comprising more than 10,000 
discrete reaction chambers; (b) an enzymatic or fluorescent means for generating light in 

20 the reaction chambers; and (c) light detection means comprising a light capture means 
and a second fiber optic bundle for transmitting light to the light detecting means, the 
second fiber optic bundle being in optical contact with the array, such that light generated 
in an individual reaction chamber is captured by a separate fiber or groups of separate 
fibers of the second fiber optic bundle for transmission to the light capture means. 

25 Another embodiment of the invention is directed to a method for carrying out 

separate parallel common reactions in an aqueous environment. The first step involves 
delivering a fluid containing at least one reagent to an array, wherein the array comprises 
a substrate comprising at least 10,000 discrete reaction chambers, each reaction chamber 
adapted to contain analytes, and wherein the reaction chambers have a volume of 

30 between 10 to 150 pL and containing a starting material that is capable of reacting with 
the reagent, each of the reaction chambers being dimensioned such that when the fluid is 
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delivered into each reaction chamber, the diffusion time for the reagent lo diffuse out of 
the well exceeds the time required for the starting material to react with the reagent to 
form a product. The second step involves washing the fluid from the array in the time 
period (i) after the starting material has reacted with the reagent to form a product in each 
5 reaction chamber but (ii) before the reagent delivered to any one reaction chamber has 
diffused out of that reaction chamber into any other reaction chamber. 

Another embodiment of the invention is directed to a method for delivering 
nucleic acid sequencing enzymes to an array. The array having a planar surface with a 
plurality of cavities thereon, each cavity forming an analyte reaction chamber, wherein 

10 the reaction chambers have a center to center spacing of between 20 to lOOjim. The 
method involves the step of dispersing over the array a plurality of mobile solid supports 
having one or more nucleic acid sequencing enzymes immobilized thereon, such that a 
plurality of the reaction chambers contain at least one mobile solid support. 

Another embodiment of the invention is directed to a method for delivering a 

15 plurality of nucleic acid templates to an array. The array may have a planar surface with 
a plurality of cavities thereon, each cavity forming an analyte reaction chamber, wherein 
the reaction chambers have a center to center spacing of between 20 to 100 |im and the 
array having at least 10,000 reaction chambers. The method comprise the step of 
dispersing over the array a plurality of mobile solid supports, each mobile solid support 

20 having no more than a single species of nucleic acid template immobilized thereon, the 
dispersion causing no more than one mobile solid support to be disposed within any one 
reaction chamber. 

Another embodiment of the invention is directed to a method for sequencing a 
nucleic acid. The method comprises the step of providing a plurality of single-stranded 

25 nucleic acid templates disposed within a plurality of cavities on a planar surface, each 
cavity forming an analyte reaction chamber, wherein the reaction chambers have a center 
to center spacing of between 20 to 100|im and the planar surface has at least 10,000 
reaction chambers. The next step involves performing a pyrophosphate based sequencing 
reaction simultaneously on all reaction chambers by annealing an effective amount of a 

30 sequencing primer to the nucleic acid templates and extending the sequencing primer 
with a polymerase and a predetermined nucleotide triphosphate to yield a sequencing 
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product and. if the predetermined nucleotide triphosphate is incorporated onlo the 3* end 
of the sequencing primer, a sequencing reaction byproduct. The third step involves 
identifying the sequencing reaction byproduct, thereby determining the sequence of the 
nucleic acid in each reaction chamber. 
5 Another embodiment of the invention is directed to a method of determining the 

base sequence of a plurality of nucleotides on an array. The first step involves providing 
at least 10,000 DNA templates, each separately disposed within a plurality of cavities on 
a planar surface, each cavity forming an analyte reaction chamber, wherein the reaction 
chambers have a center to center spacing of between 20 to 100 ^m, and a volume of 
10 between 10 to 150 pL. The second step involves adding an activated nucleotide 5- 
triphosphate precursor of one known nitrogenous base to a reaction mixture in each 
reaction chamber, each reaction mixture comprising a template-directed nucleotide 
polymerase and a single-stranded polynucleotide template hybridized to a complementary 
oligonucleotide primer strand at least one nucleotide residue shorter than the templates to 
15 form at least one unpaired nucleotide residue in each template at the y-end of the primer 
strand, under reaction conditions which allow incorporation of the activated nucleoside 
SMriphosphate precursor onto the 3 ? -end of the primer strands, provided the nitrogenous 
base of the activated nucleoside 5' -triphosphate precursor is complementary to the 
nitrogenous base of the unpaired nucleotide residue of the templates. The third step 
20 involves detecting whether or not the nucleoside 5* -triphosphate precursor was 
incorporated into the primer strands in which incorporation of the nucleoside 5'- 
triphosphate precursor indicates that the unpaired nucleotide residue of the template has a 
nitrogenous base composition that is complementary to that of the incorporated 
nucleoside 5'-triphosphate precursor. The fourth step involves sequentially repeating 
25 steps (b) and (c), wherein each sequential repetition adds and, detects the incorporation of 
one type of activated nucleoside 5^ -triphosphate precursor of known nitrogenous base 
composition. The fifth step involves determining the base sequence of the unpaired 
nucleotide residues of the template in each reaction chamber from the sequence of 
incorporation of the nucleoside precursors. 
30 Another embodiment of the invention is directed to. a method of identifying the 

base in a target position in a DNA sequence of template DNA. The first step involves 
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providing at least 10,000 separate DNA templates are separately disposed within a 
plurality of cavities on a planar surface, each cavity forming an analyte reaction chamber, 
wherein the reaction chambers have a center to center spacing of between 20 to 100 (.mi, 
the DNA being rendered single stranded either before or after being disposed in the 
5 reaction chambers. The second step involves providing an extension primer is provided 
which hybridizes to the immobilized single-stranded DNA at a position immediately 
adjacent to the target position. The immobilized single-stranded DNA is subjected to a 
polymerase reaction in the presence of a predetermined deoxynucleotide or 
dideoxynucleotide, wherein if the predetermined deoxynucleotide or dideoxynucleotide is 

10 incorporated onto the 3* end of the sequencing primer then a sequencing reaction 
byproduct is formed. The fourth step involves identifying the sequencing reaction 
byproduct, thereby determining the nucleotide complementary to the base at the target 
position in each of the 10,000 DNA templates. 

Another embodiment of the invention is directed to an apparatus for analyzing a 

15 nucleic acid sequence. The apparatus comprises: (a) a reagent delivery cuvette, wherein 
the cuvette includes an array comprising a planar surface with a plurality of cavities 
thereon, each cavity forming an analyte reaction chamber, wherein the reaction chambers 
have a center to center spacing of between 20 to 100 pm, and there are in excess of 
10,000 reaction chambers, and wherein the reagent delivery cuvette contains reagents for 

20 use in a sequencing reaction; (b) a reagent delivery means in communication with the 
reagent delivery cuvette; (c) an imaging system in communication with the reagent 
delivery chamber; and (d) a data collection system in communication with the imaging 
system. 

Another embodiment of the invention is directed to an apparatus for determining 
25 the base sequence of a plurality of nucleotides on an array. The apparatus comprises: (a) 
a reagent cuvette containing a plurality of cavities on a planar surface, each cavity 
forming an analyte reaction chamber, wherein there are in excess of 10,000 reaction 
chambers, each having a center to center spacing of between 20 to 100 pm and a volume 
of between 10 to 150 pL; (b) reagent delivery means for simultaneously adding to each 
30 reaction chamber an activated nucleotide SMriphosphate precursor of one known 
nitrogenous base to a reaction mixture in each reaction chamber, each reaction mixture 
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comprising a template-directed nucleotide polymerase and a single-stranded 
polynucleotide template hybridized to a complementary oligonucleotide primer strand at 
least one nucleotide residue shorter than the templates to form at least one unpaired 
nucleotide residue in each template at the 3*-end of the primer strand, under reaction 
5 conditions which allow incorporation of the activated nucleoside 5' -triphosphate 
precursor onto the 3 ? -cnd of the primer strands, provided the nitrogenous base of the 
activated nucleoside 5Mriphosphate precursor is complementary to the nitrogenous base 
of the unpaired nucleotide residue of the templates; (c) detection means for detecting 
in each reaction chamber whether or not the nucleoside 5 "-triphosphate precursor was 

10 incorporated into the primer strands in which incorporation of the nucleoside 5*- 
triphosphate precursor indicates that the unpaired nucleotide residue of the template has a 
nitrogenous base composition that is complementary to that of the incorporated 
nucleoside 5 -triphosphate precursor; and (d) means for sequentially repealing (b) and 
(c), wherein each sequential repetition adds and, detects the incorporation of one type of 

15 activated nucleoside 5' -triphosphate precursor of known nitrogenous base composition; 
and (e) data processing means for determining the base sequence of the unpaired 
nucleotide residues of the template simultaneously in each reaction chamber from the 
sequence of incorporation of the nucleoside precursors. 

Another embodiment of the invention is directed to an apparatus for processing a 

20 plurality of analytes. The apparatus comprises: (a) a flow chamber having disposed 
therein a substrate comprising at least 50,000 cavitated surfaces on a fiber optic bundle, 
each cavitated surface forming a reaction chamber adapted to contain analytes, and 
wherein the reaction chambers have a center to center spacing of between 20 to 100 urn 
and a diameter of 20 to 70 |im: (b) fluid means for delivering processing reagents from 

25 one or more reservoirs to the (low chamber so that the analytes disposed in the reaction 
chambers are exposed to the reagents; and (c) detection means for simultaneously 
detecting a sequence of optical signals from each of the reaction chambers, each optical 
signal of the sequence being indicative of an interaction between a processing reagent 
and the analytc disposed in the reaction chamber, wherein the detection means is in 

30 communication with the cavitated surfaces. 
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Another embodiment of the invention is directed to a method for sequencing a 
nucleic acid. The first step involves providing a plurality of single-stranded nucleic acid 
templates in an array having at least 50,000 discrete reaction sites. The second step 
involves contacting the nucleic acid templates with reagents necessary to perform a 
5 pyrophosphate-based sequencing reaction coupled to light emission. The third step 
involves delecting the light emitted from a plurality of reaction sites on respective 
portions of an optically sensitive device. The forth step involves converting the light 
impinging upon each of the portions of the optically sensitive device into an electrical 
signal which is distinguishable from the signals from all of the other reaction sites. The 

10 fifth step involves determining the sequence of the nucleic acid templates based on light 
emission for each of the discrete reaction sites from the corresponding electrical signal. 

Another embodiment of the invention is directed to a method for sequencing 
nucleic acids. The first step involves fragmenting large template nucleic acid molecules 
to generate a plurality of fragmented nucleic acids. The second step involves attaching 

15 one strand of a plurality of the fragmented nucleic acids individually to beads to generate 
single stranded nucleic acids attached individually to beads. The third step involves 
delivering a population of the single stranded fragmented nucleic acids attached 
individually to beads to an array of at least 10,000 reaction chambers on a planar surface, 
wherein a plurality of the wells comprise no more than a one bead with on single stranded 

20 fragmented nucleic acid. The fourth step involves performing a sequencing reaction 
simultaneously on a plurality of the reaction chambers. The sequencing reaction may 
have the steps of (a) annealing an effective amount of a sequencing primer to the single 
stranded fragmented nucleic acid templates and extending the sequencing primer with a 
polymerase and a predetermined nucleotide triphosphate to yield a sequencing product 

25 and, if the predetermined nucleotide triphosphate is incorporated onto the 3' end of the 
sequencing primer, a sequencing reaction byproduct; and (b) identifying the sequencing 
reaction byproduct, thereby determining the sequence of the nucleic acid in a plurality of 
the reaction chambers. Alternatively, the sequencing reaction may comprises the steps 
of: (a) hybridizing two or more sequencing primers to one or a plurality of single strands 

30 of the nucleic acid molecule wherein all the primers except for one are reversibly blocked 
primers; (b) incorporating at least one base onto the nucleic acid molecule by polymerase 
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elongation from an unblocked primer; (c) preventing further elongation or the unblocked 
primer, (d) deblocking one of the reversibly blocked primers into an unblocked primer; 
and (e) repeating steps (b) to (d) until at least one of the reversibly blocked primers are 
deblocked and used for determining a sequence. 

Other materials and methods may be found in the following copending US patent 
applications: USSN 60/443,471 filed January 29, 2003 and USSN 60/465.071 filed April 
23, 2003. All patents, patent applications, and references cited in this disclosure are 
incorporated herein by reference. 
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EXAMPLES 

Example 1 : Sample Preparation 
DNA Sample: 

5 The DNA should be of high quality and free from contaminants such as proteins, 

nucleases, lipids, and other chemicals (such as residual EDTA from preparation) and 
salts. It is preferred that genomic DNA should have a 260/280 ratio of 1.8 or higher. If it 
is desired to sequence the genome of only one organism, then the DNA should be quality 
checked to ensure that there is no contaminating DNA. For example: a preparation of 

10 human DNA may be checked by PCR to ensure that it is not contaminated by bacterial 
DNA molecules. Another method of checking for contamination is by restriction 
digestion patterns and especially restriction digestion followed by Southern Blot using 
suitable probes known to be specific for an organism (e.g., human or mouse) and a 
second probe known to be specific for a possible contaminating organism (e.g., E. coli). 

15 If it is desired, the DNA should originate from a single clone of the organism (e.g., a 
colony if from bacteria). 
Step 1: DNase I Digestion 

The purpose of the DNase I digestion step is to fragment a large stretch of DNA 
such as a whole genome or a large portion of a genome into smaller species. This 

20 population of smaller-sized DNA species generated from a single DNA template is 
referred to as a "library". Deoxyribonuclease I (DNase I) is an endonuclease which 
cleaves double-stranded template DNA. The cleavage characteristics of DNase I allow 
random digestion of template DNA (i.e., minimal sequence bias) and will result in the 
predominance of blunt-ended, double-stranded DNA fragments when used in the 

25 presence of manganese-based buffers (Melgar and Goldthwait 1968). The digestion of 
genomic templates by DNase I is dependent on three factors: i) quantity of enzyme used 
(units); ii) temperature of digestion (°C); and iii) incubation time (minutes). The DNase 1 
digestion conditions outlined below were optimized to yield DNA libraries in a size range 
from 50-700 base pairs (bp). 

30 1. DNA was obtained and prepared to a concentration of 0.3 mg/ml in Tris- 

HCI (lOmM, pH 7-8). A total of 134 \i\ of DNA (15 \xg) was needed for this preparation. 



87 



Express Mail: EV 399526138 US 21465-510 UTIL 

Mailed January 28, 2004 

It is recommended lo not use DNA preparations diluted with buffers containing EDTA (i.e., 
TE, Tris/EDTA). The presence of EDTA is inhibitory to enzyme digestion with DNase I. 
If the DNA preparation contains EDTA, it is important that the DNA be "salted" out of 
solution and reconstituted with the appropriate Tris-HCI buffer (10 mM, pH 7-8) or 
5 nanopure H 2 0 (pH 7-8). 

2. In a 0.2 ml tube, DNase I Buffer, comprising 50 jaI Tris pH 7.5 (IM)> 10 
\x\ MnCh (1M), 1 \i\ BSA (100 mg/ml), and 39 jil water was prepared. 

3. In a separate 0.2 ml tube, 15 pi of DNase I Buffer and 1.5 \il of DNase 1 
(lU/ml) was added. The reaction tube was placed in a thermal cycler set to 15°C. 

10 4. The 134 jal of DNA (0.3 mg/ml) was added to the DNase I reaction tube 

placed in the thermal cycler set at 15°C. The lid was closed and the sample was incubated 
for exactly 1 minute. Following incubation, 50 (il of 50 mM EDTA was added to stop the 
enzyme digestion. 

5. The digested DNA was purified by using the QiaQuick PCR purification kit. 
15 The digestion reaction was then split into four aliquots, and four spin columns were used to 

purify each aliquot (37.5 \x\ per spin column). Each column was eluted with 30 |il elution 
buffer (EB) according to the manufacturer's protocol. The eluates were then combined to 
generate a final reaction volume of 120 jil. 

6. One 3 jil aliquot of the digestion reaction was saved for analysis using a 
20 BioAnalzyer DNA 1 000 LabChip. 



Step 2: Pfu Polishing 

Digestion of DNA templates with DNase 1 yields fragments of DNA that are 
primarily blunt-ended, however, some fragments will have ends that contain protruding 

25 termini that are one or two nucleotides in length. Pfu polishing is used to increase the 
amount of blunt-ended species by fill-in (i.e., "blunting") of 5' overhangs. Additionally, 
Pfu DNA polymerase has 3'-> 5' exonuclease activity that will result in the removal of 
single and double nucleotide extensions. Pfu polishing increases the amount of blunt- 
ended DNA fragments available for adaptor ligation (Costa 1994a, 1994b, 1994c). The 

30 following Pfu polishing protocol was used. 
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1 . In a 0.2 ml tube, 1 15 \i\ purified, DNase I-digested DNA fragments, 15 \x\ 
1 OX Cloned Pfu buffer, 5 yX dNTPs (10 mM), and 15 ^1 cloned Pfu DNA polymerase 
(2.5 U/|il) were added in order. 

2. The polishing reaction components were mixed well and incubated at 72° 
5 C for 30 minutes. 

3. Following incubation, the reaction tube was removed and placed on ice for 2 
minutes. 

4. The polishing reaction mixture was then split into four aliquots and purified 
using QiaQuick PCR purification columns (37.5 |iL on each column). Each column was 

10 eluted with 30 jjiI buffer EB according to the manufacturer's protocol. The eluatcs were 
then combined to generate a final reaction volume of 120 \iL. 

5. One 3 \x\ aliquot of the final polishing reaction was saved for analysis using 
a BioAnalzyer DNA 1000 LabChip. 

1 5 Step 3: Ligation of Universal Adaptors to Fragmented DNA Library 

Following fragmentation and polishing of the genomic DNA library, primer 
sequences are addecj to the ends of each DNA fragment. These primer sequences are 
termed "Universal Adaptors'' and are comprised of double-stranded oligonucleotides that 
contain specific priming regions that afford both PCR amplification and nucleotide 

20 sequencing. The Universal Adaptors are designed to include a set of unique PCR 
priming regions that are 20 base pairs in length located adjacent to a set of unique 
sequencing priming regions that are 20 base pairs in length, followed by a unique 4-base 
"key" consisting of one of each deoxyribonucleotide (i.e., A, C, G. T). Each unique 
Universal Adaptor (termed "Universal Adaptor A" and "Universal Adaptor B v ) is forty- 

25 four base pairs (44 bp) in length. Universal Adaptors are ligated, using T4 DNA ligasc, 
onto each end of the DNA fragment to generate a total nucleotide addition of 88 bp to 
each DNA fragment. Different Universal Adaptors are designed specifically for each 
genomic DNA library preparation and will therefore provide a unique identifier for each 
organism. 

30 To prepare a pair of Universal Adaptors, single-stranded oligonucleotides are 

designed in-house and are manufactured through a commercial vendor. Universal 
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Adaptor DNA oligonucleotides are designed with two phosphorothioate linkages at each 
oligonucleotide end that serve to protect against nuclease activity (Samini, T.D., B. 
Jolles, and A. Laigle. 2001. Best minimally modified antisense oligonucleotides 
according to cell nuclease activity. Antisense Nucleic Acid Drug Dev. 1 1(3): 129., the 
5 disclosure of which is incorporated in toto herein by reference.). Each oligonucleotide is 
HPLC-purified to ensure there are no contaminating or spurious DNA oligonucleotide 
sequences in the final prep. 

The Universal Adaptors are designed to allow directional ligation to the blunt- 
ended, fragmented genomic DNA. For each Universal Adaptor pair, the PCR priming 

10 region contains a 5' four-base overhang and a blunt-ended 3* Key region. Directionality 
is achieved as the blunt-end side of the Universal Adaptor ligates to the blunt-ended DNA 
fragment while the 5' overhang of the adaptor cannot ligate to the blunt-ended DNA 
fragment. Additionally, a 5' biotin is added to the Universal Adaptor B to allow 
subsequent isolation of ssDNA template (Step 8). Each Universal Adaptor is prepared by 

1 5 annealing, in a single tube, the two single-stranded complementary DNA 

oligonucleotides (i.e., one oligo containing the sense sequence and the second oligo 
containing the antisense sequence). The following ligation protocol was used. 

1. In a 0.2 ml tube, 39 \xl nH 2 0 (molecular biology grade water), 25 \i\ 
digested, polished DNA Library, 100 ^1 2X Quick Ligase Reaction Buffer, 20 MMP1 

20 (10 pm/|il) adaptor set, 100:1 ratio, and 16 ^1 Quick Ligase were added in order. The 
ligation reaction was mixed well and incubated at RT for 20 minutes. 

2. The ligation reaction was then removed and a 10-|J aliquot of the ligation 
reaction was purified for use on the BioAnalyzer. A single spin column from the Qiagen 
Min-Elute kit was used. The column was eluted with 10 |il EB according to the 

25 procedure per manufacturers' protocol. A l-|il aliquot of the purified ligation reaction 
was loaded using a BioAnalyzer DNA 1000 LabChip. This purification step is 
recommended as the unpurified ligation reaction contains high amounts of salt and PEG 
that will inhibit the sample from running properly on the BioAnalyzer. 

3. The remainder of the ligation reaction (190 (iL) was used for gel isolation 
30 in Step 4. 
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Step 3a: Microcon Filtration and Adaptor Construction. Total preparation time 
was approximately 25 min. 

The Universal Adaptor ligation reaction requires a 100-fold excess of adaptors. 
To aid in the removal of these excess adaptors, the double-stranded gDNA library is 
5 filtered through a Microcon YM-100 filter device. Microcon YM-100 membranes can be 
used to remove double stranded DNA smaller than 125 bp. Therefore, unbound adaptors 
(44 bp), as well as adaptor dimers (88 bp) can be removed from the li gated gDNA library 
population. The following filtration protocol was used: 

1. The 190 pL of the ligation reaction from Step 4 was applied into an 
1 0 assembled Microcon YM- 1 00 device. 

2. The device was placed in a centrifuge and spun at 5000 x g for 
approximately 6 minutes, or until membrane was almost dry. 

3. To wash, 200 |il of 1 X TE was added. 

4. Sample was spun at 5000 x g for an additional 9 minutes, or until 
1 5 membrane was almost dry. 

5. To recover, the reservoir was inserted into a new vial and spun at 3000 x g 
for 3 minutes. The reservoir was discarded. The recovered volume was approximately 
10 pi. Next, 80 pi TE was added. 

The Adaptors (A and B) were HPLC-purified and modified with phosphorothioate 
20 linkages prior to use. For Adaptor "A" (10 pM), 10 pi of 100 pM Adaptor A (44 bp, 
sense) was mixed with 10 pi of 100 pM Adaptor A (40 bp, antisense), and 30 pi of IX 
Annealing Buffer (Vf = 50 pi) were mixed. The primers were annealed using the 
ANNEAL program on the Sample Prep Labthermal cycler (see below). For Adaptor U B" 
(10 pM), 1 0 pi of 1 00 pM Adaptor B (40 bp, sense) was mixed with 10 pi of 1 00 pM 
25 Adaptor B (44 bp, antisense), and 30 pi of IX Annealing Buffer (V f = 50 pi). The primers 
were annealed using the ANNEAL program on the Sample Prep Lab thermal cycler. 
Adaptor sets could be stored at -20°C until use. 

ANNEAL-A program for primer annealing: 
1. Incubate at 95°C, 1 min: 
30 2. Decrease temperature to 1 5°C, at 0. 1 °C/sec; and 

3. Hold at 15°C 
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There was no orientation required for the genomic DNA insert fragment and the 
adaptors. Fragments could be ligated at either end. Four single-stranded DNA 
oligonucleotides were included in the Universal Adaptor set. Each single-stranded 
5 oligonucleotide was synthesized at 1 micromole scale and HPLC-purified. Each single- 
stranded oligonucleotide included four phosphorothioate linkages at each end. 

Step 4: Gel Electrophoresis and Extraction of Adapted DNA Library 

The Universal Adaptor ligation protocol produces the following: 1) fragmented 
10 DNAs with adaptors on either end; 2) unbound single adaptors; or 3) the formation of 
adaptor dimers. Agarose gel electrophoresis is used as a method to separate and isolate 
the adapted DNA library population from the unligated, single adaptors and adaptor 
dimer populations. The procedure of DNase I digestion of genomic DNA yields a library 
population that ranges from 50-700 bp (Step 1). The addition of the 88-bp Universal 
15 Adaptor set will shift the population to a larger size and will result in a migration profile 
in the size range of approximately 130-800 bp. Adaptor dimers will migrate at 88 bp and 
adaptors unligated will migrate at 44 bp. Therefore, genomic DNA libraries in size 
ranges > 200 bp can be physically isolated from the agarose gel and purified using 
standard gel extraction techniques. Gel isolation of the adapted DNA library will result 
20 in the recovery of a library population in a size range that is >200 bp (size range of 
library can be varied depending on application). The following electrophoresis and 
extraction protocol was used. 

1 . A 2% agarose gel was prepared. 

2. 10 jal of 10X Ready-Load Dye was added to the remaining 90 |il of the 
25 DNA ligation mixture. 

3. The dye/ligation reaction mixture was loaded into the gel using four 
adjacent lanes (25 |il per lane). 

4. 10 \x\ of thelOO bp ladder (0.1 ng/|il) was loaded two lanes away from 
ligation reaction lanes. 

30 5. The gel was run at 100V for 3 hours. 

6. When the gel run was complete, the gel was removed from the gel box and 
transferred to a flat surface covered with plastic wrap. DNA bands were visualized using 
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a hand-held long-wave UV light. Using a sterile, single-use scalpel, the fragment sizes of 
200 - 400 bp were cut out from the agarose gel. Using this approach, libraries with any 
size range can be isolated. It is also possible to isolate more than one size range. Where 
the library size range is 200-900 bp, it is possible to isolate several size ranges from a 
5 single well (i.e., 200-400 bp and 500-700 bp). 

7. The DNA embedded in the agarose gel was isolated using a Qiagen 
MinElute Gel Extraction kit following the manufacturer's instructions. Briefly, Buffer 
QG was added to cover the agarose in the tube. The agarose was allowed to completely 
dissolve. The color of the Buffer QG was maintained by adjusting the pH according to 

10 the Qiagen instructions to minimize sample loss. Two MinElute spin columns (Qiagen) 
were used for purification. The large volume of dissolved agarose required each column 
to be loaded several times. The columns were eluded with 10 \i\ of Buffer EB which was 
pre-warmed at 55°C. The eluates were pooled to produce 20 pi of gDNA library. 

8. One 1 |iL aliquot of each isolated DNA library was analyzed using a 
15 BioAnalyzer DNA 1000 LabChip to assess the exact distribution of the DNA library 

population. 

Step 5: Strand Displacement and Extension of Nicked Double Stranded DNA 
Library 

20 Because the DNA oligonucleotides used for the Universal Adaptors are not 

phosphoryiated, gaps are present at the 3 ? junctions of the fragmented gDNAs. These 
two "gaps" or "nicks" can be filled in by using a strand displacing DNA polymerase. 
The polymerase recognizes nicks, displaces the nicked strands, and extends the strand in 
a manner that results in repair of nicks and in the formation of non-nicked double- 

25 stranded DNA. The strand displacing enzyme used is the large fragment of Bst DNA 
polymerase. 

1. In a 0.2 ml tube, 19 |il gel-extracted DNA library, 40 \i\ nH 2 O y 8 fil 10X 
ThermoPol Reaction Buffer, 8 \xl BSA (1 mg/ml), 2 ^1 dNTPs (10 mM), and 3 ^1 Bst I 
Polymerase (8 U/|il) were added in order. 
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2. The samples were mixed well and placed in a thermal cycler and 
incubated using the Strand Displacement incubation program: "BST". BST program for 
stand displacement and extension of nicked double-stranded DNA: 

1 . Incubate at 65° C, 30 minutes; 
5 2. Incubate at 80° C, 10 minutes; 

3. Incubate at 58° C, 1 0 minutes; and 

4. Hold at 14' C. 

3. One 1 \iL aliquot of the Art-treated DNA library was run using a 
BioAnalyzer DNA 1000 LabChip. 

10 

Step 6: Preparation of Streptavidin Beads 

Following the generation of unnicked double-stranded genomic DNA, it is 
necessary to isolate single-stranded genomic DNAs containing flanking Universal 
Adaptor sequences. This step outlines the binding of biotin-tagged double-stranded DNA 
15 to streptavidin beads. For preparing streptavidin beads, the following protocol was used. 

1. 100 \i\ Dynal M-270 Streptavidin beads were washed two times with 200 
Hi of IX Binding Buffer (1 M NaCI, 0.5 mM EDTA, 5 mM Tris, pH 7.5) by applying the 
magnetic beads to the MPC. 

2. The beads were resuspended in 100 jal 2X Binding buffer, then the 
20 remaining 79 \i\ of the Art-treated DNA sample (from Step 5) and 20 ^1 water was added. 

3. The bead solution was mixed well and placed on a tube rotator at RT for 
20 minutes. The bead mixtures were washed, using the MPC, two times with 100 \il of 
IX Binding Buffer, then washed two times with nH 2 0. Binding & Washing (B&W) 
Buffer (2X and IX): 2X B&W buffer was prepared by mixing 10 mM Tris»HCl (pH 

25 7.5), 1 mM EDTA, and 2 M NaCI. The reagents were combined as listed above and 
mixed thoroughly. The solution can be stored at RT for 6 months; IX B&W buffer was 
prepared by mixing 2X B&W buffer with nH 2 0, 1:1. The final concentrations were half 
the above, i.e., 5 mM Tris^HCl (pH 7.5), 0.5 mM EDTA, and 1 M NaCI. 
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Step 7: Isolation of single-stranded DNA Library using Streptavidin Beads 

Following binding of the double-stranded gDNA library to streptavidin beads, it is 
preferred to isolate from the ligated pool only the single-stranded gDNAs containing 
Universal Adaptor A and Universal Adaptor B (desired populations are designated below 
5 with asterisks). Double-stranded genomic DNA fragment pools will have adaptors bound 
in the following possible configurations: 

Universal Adaptor A- gDNA Fragment -Universal Adaptor A 
Universal Adaptor B- gDNA Fragment -Universal Adaptor A* 
Universal Adaptor A- gDNA Fragment -Universal Adaptor B* 
10 Universal Adaptor B- gDNA Fragment -Universal Adaptor B 

Because only the Universal Adaptor B has a 5' biotin moiety, magnetic 
streptavidin-containing beads can be used to bind all gDNA library species that possess 
the Universal Adaptor B. Genomic library populations that contain two Universal 
Adaptor A species (or nonligated species) do not bind to streptavidin-containing beads 
15 and are removed during the wash procedure. The species that remain bound to bead after 
washing include those with Universal Adaptors A and B or those with two Universal 
Adaptor B ends. 

Genomic DNA species with two Universal Adaptor B sequences with two biotin 
molecules can bind to the streptavidin-containing beads at both ends. Species with A and 

20 B adaptors having only a single biotin molecule can bind to the beads only at the "B v 
end. To isolate the single-stranded population, the bead-bound double-stranded DNA is 
treated with a sodium hydroxide solution that serves to disrupt the hydrogen bonding 
between the complementary DNA strands. If the DNA fragment has biotin on each end 
(Universal Adaptor B ends), both resulting single strands remain bound to the beads. If 

25 the fragment has only a single biotin (Universal Adaptors A and B), then the 
complementary strand separates from the DNA-bead complex. 

The resulting single-stranded genomic DNA library is collected from the solution 
phase and is quantitated, e.g., using pyrophosphate sequencing (PyroSequence) or by 
using a RNA Pico 6000 LabChip (Agilent, Palo Alto, CA). Single-stranded genomic 

30 DNA libraries are quanlitatcd by calculating the number of molecules per unit volume. 
Single-stranded gDNA molecules are then annealed (at a half copy per bead to obtain one 
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effective copy per bead) to 25-30 |ini sepharose beads containing DNA capture primers 
(PCR primer B). The templates are then amplified using emulsion polymerase chain 
reaction protocols. Subsequent sequencing may be conducted using known techniques. 
For isolation of the single stranded library, the following protocol was used. 

1. 250 \xl Melt Solution (0.125 M NaOH, 0.1 M NaCl)was added to washed 
beads from Step 6 above. 

2. The bead solution was mixed well and the bead mixture was incubated at 
room temperature for 10 minutes on a tube rotator. 

3. A Dynal MPC (magnetic particle concentrator) was used, the pellet beads 
were carefully removed, and the supernatant was set aside. The 250-|il supernatant 
included the single-stranded DNA library. 

4. In a separate tube, 1250 \i\ PB (from QiaQuick Purification kit) was added 
and the solution was neutralized by adding 9 ^1 of 20% acetic acid. 

5. Using a Dynal MPC, beads from the 250-i.il supernatant including the 
single-stranded gDNA library were pelleted and the supernatant was carefully removed 
and transferred to the freshly prepared PB/acetic acid solution. 

6. The 1500 |il solution was purified using a single QiaQuick purification 
spin column (load sample through same column two times at 750 |il per load). The 
single-stranded DNA library was cluted with 50 |il EB. 

Step 8a: Single-stranded gDNA Quantitation using Pyrophosphate Sequencing. 
Total preparation time was approximately 1 lir. 

1 . In a 0.2 ml tube, the following reagents were added in order: 

25 jal single-stranded gDNA 

1 |il MMP2B sequencing primer 
14 |il Library Annealing Buffer 
40 |il total 

2. The DNA was allowed to anneal using the ANNEAL-S Program (see 
Appendix, below). 
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3. The samples were run on PSQ (pyrophosphate sequencing jig) to 
determine the number of picomoles of template in each sample (see below). Methods of 
sequencing can be found in U.S. Patent 6,274,320; U.S. Patent 4,863,849; U.S. Patent 
6,210,891; and U.S. Patent 6,258,568, the disclosures of which are incorporated in toto 
5 herein by reference. Calculations were performed to determine the number of single- 
stranded gDNA template molecules per microliter. The remaining 25 (iL of prepared 
single-stranded gDNA library was used for amplification and subsequent sequencing 
(approximately 1 x 10 6 reactions). 

10 Step 8b: Single-stranded gDNA Quantitation using RNA Pico 6000 LabChip. Total 
preparation time was approximately 30 minutes. 

1. The mRNA Pico. assay option was selected on the Bio Analyzer (Software 
version 2.12). 

2. An RNA Pico 6000 LabChip was prepared on the BioAualyzer according 
1 5 to the manufacturers' guidelines. 

3. An RNA LabChip ladder (RNA 6000 ladder) was prepared according to 

manufacturer's (Ambion) directions. Briefly, the RNA LabChip ladder, in solution, was 
heated to 70°C for 2 minutes. The solution was chilled on ice for 5 minutes to snap cool 
the ladder. The solution was briefly centrifuged to clear any condensate from tube walls. 
20 The RNA LabChip Ladder was stored on ice and used within one day. 

4. The ssDNA library to be analyzed was run in triplicate, in adjacent lanes, 
using three 1 nl aliquots. 

5. The BioAnalyzer software was used to calculate the concentration of each 
ssDNA library lane (see the Table below and Figure 24. The average of all three lanes 

25 was used to calculate the DNA concentration of the library using the procedure outlined 
below. 

a. The peak integration lower limit line (large dash in Figure 24) was 
moved immediately in front of the library peak (see below). 

b. The peak integration upper limit line (large clash in the Figure 24) 
30 was moved immediately after the library peak. In this way, the peak integration 

line connecting the lower and upper integration lines followed the slope of the 
background. 
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c. The mouse arrow was used to determine the average size of the 
peak in bases (usually near the peaks highest point) or a defined peak was used as 
chosen by the software. 

d. The integrated value was used for the amount of material in the 
peak. The value obtained Jor picograms recovered was converted into molecules 
recovered (see Table, below). The library concentration was then determined 
(molecules per microliter). 

Table 











5 


6 


7 


8 


9 


1 


2 


3 


4 


Average 


Mean 


Mean 


Mean 


Average 




Pft/HL(1) 


Pfi/HL (2) 


PS/ML(3) 


pg/nL 


Size (bp) 1 


Size (bp) 2 


Size (bp) 3 


Size (bp) 


sample 


1633 


1639 


1645 


1639 


435 


435 


432 


434 




10 




11 


12 











10 



Ave MW (g/mole) 


Ave MW 


Library 


13 14 15 


Ribonucleotide 


(g/mole) 




moles/g 


moles/pL 


moleculcs/pL 


328.2 


1 42E+05 


1.64E-09 


7.02E-06 


I.15E-14 


6.93E+09 



As shown in the Table above, the concentration of Library 1 was calculated as 
1639 pg/|.il (Column 5) and the average fragment size was 434 nucleotides (Column 9). 

15 These values were obtained from the Agilent 2100 software as described in Steps (a)-(d), 
above. The average molecular weight (MW) of a ribonucleotide is 328.2 g/mole 
(Column 10). The MW of the average library fragment (1.42 x 10 5 g/mole. Column 11) 
was calculated by multiplying the average fragment length (434) by the average 
ribonucleotide (328.2). The quantitated library (1639 pg/^il) was converted to grams per 

20 microliter (1.64 x 10" 9 g/|il, Column 12). The number of moles per microliter (1.15 x 10" 
14 moles/|iK Column 14) was calculated by dividing the grams per microliter ( 1.64 x 10' 9 
g/|il, Column 12) by the average molecular weight of the library fragments (1.42 x 10 5 , 
Column 11). Finally, the number of molecules per microliter (6.93 x 10° molecules^!, 
Column 1 5) was derived by multiplying the number of moles per microliter ( 1 . 1 5 x 1 0* 14 

25 moles/|il. Column 14) by Avogadro's number (6.02 x 10 23 molecules/mole). 
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25 



The final library concentration was expected to be greater than I x 1 0 X 
molccules/pl. A more important factor for library quality was adaptor dimer 
concentration. In Figure 24, the height of the library peak was determined approximately 
10 fold greater than the adaptor dimer peak (the first peak after the marker). A library of 
5 good quality is expected to have a peak height at least 2 fold greater than the dimer peak. 
It should be noted that the RNA Pico 6000 LabChip provided estimates within 500% 
accuracy of the single-stranded gDNA concentration. Thus, it was important to perform 
an initial sequencing run using a titration of template to determine the number of copies 
per bead (cpb) of input gDNA. The recommended input DNA is 2.5 cpb. 1 cpb, 0.5 cpb, 
10 and 0.1 cpb. This titration was easily checked using the 4slot bead loading chamber on a 
14x43 PTP. 

Step 9: Dilution and Storage of Single-Stranded gDNA library 

The single-stranded gDNA library was eluted and quantitated in Buffer EB. To 
15 prevent degradation, the single-stranded gDNA library was stored frozen at -20°C in the 
presence of EDTA. Afler quantitation, an equal volume of 10 mM TE was added lo the 
library stock. All subsequent dilutions was in TE. The yield was as follows: 

Remaining final volume of ssDNA library following PSQ analysis = 25 pi. 

Remaining final volume of ssDNA library following LabChip analysis = 47 pi. 
20 For the initial stock dilution, single-stranded gDNA library was diluted to 1 00 

million molecules/).il in IX Library-Grade Elution Buffer. Aliquots of single-stranded 
gDNA library were prepared for common use. For this, 200,000 molecu!es/|il were 
diluted in IX Library-Grade Elution Buffer and 20 ^1 aliquots were measured. Single-use 
library aliquots were stored at -20°C. 



Step 10: Emulsion Polymerase Chain Reaction 

Where increased numbers of cpb were preferred, bead emulsion PCR was 
performed as described in U.S. Patent Application Serial No. 06/476,504 filed June 6, 
2003, incorporated herein by reference in its entirety. 



30 
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Reagent preparation 

The Stop Solution (50 mM EDTA) included 100 of 0.5 M EDTA mixed with 
900 \x\ of nH 2 0 to obtain 1 .0 ml of 50 mM EDTA solution. For 1 0 mM dNTPs, (10 ^1 
dCTP (100 mM), 10 til dATP (100 mM), 10 pi dGTP (100 mM). and 10 til dTTP (100 
5 mM) were mixed with 60 j^lI molecular biology grade water. All four 1 00 mM nucleotide 
stocks were thawed on ice. Then, 10 (il of each nucleotide was combined with 60 \x\ of 
nH 2 0 to a final volume of 100 and mixed thoroughly. Next, 1 ml aliquots were 
dispensed into 1 .5 ml microcentrifuge tubes. The stock solutions could be stored at - 
20°C for one year. 

10 The 10 X Annealing buffer included 200 mM Tris (pH 7.5) and 50 mM 

magnesium acetate. For this solution. 24.23 g Tris was added to 800 ml nH 2 0 and the 
mixture was adjusted to pH 7.5. To this solution, 10.72 g of magnesium acetate was 
added and dissolved completely. The solution was brought up to a final volume of 1000 
ml and could be stored at 4°C for 1 month. The 10 X TE included 100 mM Tris*HCI (pH 

15 7.5) and 50 mM EDTA. These reagents were added together and mixed thoroughly. The 
solution could be stored at room temperature for 6 months. 

Example 2: Primer Design 

As discussed above, the universal adaptors are designed to include: 1 ) a set of 

20 unique PCR priming regions that are typically 20 bp in length (located adjacent to (2)): 2) 
a set of unique sequencing priming regions that are typically 20 bp in length; and 3) 
optionally followed by a unique discriminating key sequence consisting of at least one of 
each of the four deoxyribonucleotides (i.e., A, C, G, T). The probability of cross- 
hybridization between primers and unintended regions of the genome of interest is 

25 increased as the genome size increases and length of a perfect match with the primer 
decreases. However, this potential interaction with a cross-hybridizing region (CHR) is 
not expected to produce problems for the reasons set forth below. 

In a preferred embodiment of the present invention, the single-stranded DNA 
library is utilized for PGR amplification and subsequent sequencing. Sequencing 

30 methodology requires random digestion of a given genome into 150 to 500 base pair 
fragments, after which two unique bipartite primers (composed of both a PCR and 
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sequencing region) are ligated onlo the 5' and 3 : ends of Ihe fragments (Figure 25). 
Unlike typical PCR amplifications where an existing section of the genome is chosen as a 
priming site based on melting temperature (T m ), uniqueness of the priming sequence 
within the genome and proximity to the particular region or gene of interest, the disclosed 
5 process utilizes synthetic priming sites that necessitates careful de novo primer design. 

Tetramer Selection: 

Strategies for de novo primer design are found in the published literature 

10 regarding work conducted on molecular tags for hybridization experiments (see, Hensel, 
M. and D. W. Holden, Molecular genetic approaches for the study of virulence in both 
pathogenic bacteria and fungi. Microbiology, 1996. 142(Pt 5): p. 1049-58; Shoemaker, 
D.D., et al., Quantitative phenotypic analysis of yeast deletion mutants using a highly 
parallel molecular bar-coding strategy. Nat Genet, 1996. 14(4): p. 450-6) and PCR/LDR 

1 5 (polymerase chain reaction/ligation detection reaction) hybridization primers (see, Gerry, 
N.P., et al., Universal DNA microarray method for multiplex detection of low abundance 
point mutations. Journal of Molecular Biology, 1999. 292: p. 25 1-262; Witowski, N.E., et 
al M Microarray-based detection of select cardiovascular disease markers. BioTechniques, 
2000. 29(5): p. 936-944.). 

20 The PCR/LDR work was particularly relevant and focused on designing 

oligonucleotide t: zipcodes*\ 24 base primers comprised of six specifically designed 
tetramers with a similar final T m . (see, Gerry, N.P., et ah, Universal DNA microarray 
method for multiplex detection of low abundance point mutations. Journal of Molecular 
Biology, 1999. 292: p. 251-262; U.S. Pat. No. 6,506,594). Tetrameric components were 

25 chosen based on the following criteria: each tetramer differed from the others by at least 
two bases, tetramers that induced self-pairing or hairpin formations were excluded, and 
palindromic (AGCT) or repetitive tetramers (TATA) were omitted as well Thirty-six of 
the 256 (4 4 ) possible permutations met the necessary requirements and were then 
subjected to further restrictions required for acceptable PCR primer design (Table 1). 

30 
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Table 1 



| TT | TC 


TC 


TA 


CT 


CC 


CG 


CA 


GT 


QC 


GG 


GA 


AT 


AC 


AG 


AA 


TT 


tun 


TUC 


IMG 


TTTA 


T7CT 


TTOC 


TTOG 


TTCA 


i IGI 


TTGC 


TTOG 


TTCA 


TTAT 


TTAC 


TOG 


7TAA 


TC 


TOT 


bad 


1CTG 


TCTA 


TOcr 


TCOC 


TOOG 


TOGA 


TOGT 


TOGC 


TOOG 


7CG4 


TCAT 


TC°C 


TCAG 


TCM 


TC 


IGM 


TGTC 


liaid 


IGIA 


TOCT 


TQOC 


TOOG 


TCC4 


TQGT 


TQOC 


TOOG 


TOGA 


TCAT 


TC*C 


TGfiG 


TG*A 


TA 


TATT 


TATC 


TATC 




TACT 


TACC 


TAOG 


TACA 


TACT 


TfiCC 


TAGG 


TAGA 


TAAT 


TAX 


TAAG 


TAAA 


CT 


CTTT 


enc 


cms 


OTA 


ESI 


cnx 


CTDG 


OCA 


C1GT 


croc 


CTOG 


CIGA 


CTAT 


CTX 


cms 


CTM 


cc 


cm 


0OC 


OCTG 


CCTA 


COCT 


COOG 


O0GA 


OOGT 


O0GC 


CCGS 


OOGA 


CCAT 


OCX 


COG 


OG^A 


OG 


OGTT 


OGIC 


OGIG 


OGTA 


09CT 




OQGA 


OQGT 


rm: 


OGQG 


OQGA 


OGAT 


OGAC 


OGAG 


CGAA 


CA 


GATT 


CATC 


CATC 


CATA 


(XT 


exc 


ocg 


E3 


CAGT 


CAGC 


CfiCG 


GfiGA 


CAAT 


ovo 


C¥G 


CAM 


GT 


Gill 


GIIC 


Gl IG 


GTTA 


GTCT 


GTDC 


GTOG 


GIGA 


Hal 


GIGC 


GIQG 


GFGA 


GTAT 


gtx 


GTAG 


GTM 


GC 


GOT 


CEIC 


GUIG 


GCTA 


Goer 


GOX 


GOOG 


GOCA 


GOGT 


GE3 


GOGG 


GOGA 


GOAT 


GCX 


OCX 


GCAA 


OG 


GOT 


GGTC 


GGIG 


GGTA 


G3CT 


GQX 


QG03 


GQCA 


GQGT 


GGGC 


pee? 


GGGA 


OGAT 


QGAC 




GGM 


GA 


GMT 


G47C 


GAPS 


GATA 


OCT 


GOCC 


G^OG 


GACA 


GOGT 


GfiGC 


GOG 


igggg 


GAAT 


cm: 


GVG 


GflM 


AT 


ATTT 


ATTC 


ATTC 


ATTA 


ATCT 


ATOC 


AICG 


ATOA 


ATGT 


ATOC 


ATQG 


ATOA 




ATX 


ATAG 


ATM 


X 


pen 


XTC 


xrc 


ACTA 


AXT 


AOX 


ACOG 


ACDA 


ACGT 


AOGC 


ACGG 


AGGA 


ACAT 




AGOG 


XM 


AG 


A3TT 


AGTC 


fiGVG 


A3TA 


AGCT 


A3X 


AGOG 


AGCA 


AGGT 


A3GC 


AGOG 


AGGA 


AGAT 


A3AC 




^GM 


M 


MJT 


MIC 


MTC 


MTA 


axt 


AAOC 


AACG 


MCA 


ArVST 


A*X 


AAG3 


APGA 


MAT 


AAAC 


AAflG 



5 

The table shows a matrix demonstrating tetrameric primer component selection 
based on criteria outlined by Gerry et al. 1999. J. MoL Bio. 292: 251-262. Each tetramer 
was required to differ from all others by at least two bases. The tetramers could not be 
palindromic or complimentary with any other tetramer. Thirty-six tetramers were 
selected (bold, underlined); italicized sequences signal palindromic tetramers that were 
excluded from consideration. 

Primer Design: 

The PCR primers were designed to meet specifications common to general primer 
design (see, Rubin, E. and A.A. Levy, A mathematical model and a computerized 
simulation of PCR using complex templates. Nucleic Acids Res, 1996. 24(18): p. 3538- 
45; Buck, G.A., et al., Design strategies and performance of custom DNA sequencing 
primers. Biotechniques, 1999. 27(3): p. 528-36), and the actual selection was conducted 
by a computer program, MMP. Primers were limited to a length of 20 bases (5 tetramers) 
for efficient synthesis of the total bipartite PCR/sequencing primer. Each primer 
contained a two base GC clamp on the 5' end, and a single GC clamp on the V end 
(Table 2). and all primers shared similar T m (+/- 2°C) (Figure 27). No hairpinning within 
the primer (internal hairpin stem AG > -1 .9 kcal/mol) was permitted. Dimerization was 
also controlled; a 3 base maximum acceptable dimcr was allowed, but it could occur in 
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final six 3' bases, and the maximum allowable AG for a 3' dimer was -2.0 kcal/mol. 
Additionally, a penalty was applied to primers in which the 3' ends were loo similar to 
others in the group, thus preventing cross- hybridization between one primer and the 
reverse complement of another. 



Table 2 





1-pos 


2-pos 


3-pos 


4-pos 


5-pos 


1 


CCAT 


TGAT 


TGAT 


TGAT 


ATAC 


2 


CCTA 


CTCA 


CTCA 


CTCA 


AAAG 


3 


CGAA 


TACA 


TACA 


TACA 


TTAG 


4 


CGTT 


AGCC 


AGCC 


AGCC 


AATC 


5 


GCAA 


GACC 


GACC 


GACC 


TGTC 


6 


GCTT 


TCCC 


TCCC 


TCCC 


AGTG 


7 


GGAC 


ATCG 


ATCG 


ATCG 


CTTG 


8 


GGTA 


CACG 


CACG 


CACG 


GATG [ 


9 




TGCG 


TGCG 


TGCG 


TCTG 


10 




ACCT 


ACCT 


ACCT 




H 




GTCT 


GTCT 


GTCT 




12 




AGGA 


AGGA 


AGGA 




13 




TTGA 


TTGA 


TTGA 




14 




CAGC 


CAGC 


CAGC 




15 




GTGC 


GTGC 


GTGC 




16 




ACGG 


ACGG 


ACGG 




17 




CTGT 


CTGT 


CTGT 




18 




GAGT 


GAGT 


GAGT 




19 




TCGT 


TCGT 


TCGT 





Table 2 shows possible permutations of the 36 selected tetrads providing two 5 T 
and a single 3' G/C clamp. The internal positions are composed of remaining tetrads. 
This results in 8x19x19x19x9 permutations, or 493,848 possible combinations. 
Figure 27 shows first pass, T m based selection of acceptable primers, reducing field of 
493,848 primers to 56,246 candidates with T m of 64 to 66°C. 
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Table 3 



The probability of perfect sequence nratches fa prirrere ^ 

size of the genorrre of interest 



telch Length 


Perfect match 
probability 
(1/(4Alendh)) 


% chanoe for match in Ajeno - 
35Khases 


% chance fa match in NC8 
bacterial database -488M 
bases 


%chance fa match in rtman 
-3B bases 


20 


9.1E-13 


0.00% 


0.04% 


0.27% 


19 


7.3E-12 


0.00% 


0.68% 


4.32% 


18 


4.4E-11 


0.00% 


5.76% 


34.37% 


17 


23E-10 


0.00% i 


35.69% 


99.17% 


16 


1.2E-09 


0.02% 


97.52% 


>100% 


15 


5.6&09 


0.12% 


>100% 


> 100% 


14 


26B08 


0.64% 


>100% 


>100% 


13 


12E07 


3.29% 


>100% 


>100% 


12 


5.4&07 


15.68% 


>100% 


>100% 


11 


24&06 


58.16% 


>100% 


>100% 


10 


1.0305 


99.35% 


>100% 


>100% 


9 


4.6E05 


99.77% 


>100% 


>100% 


8 


20E-O4 


>100% 


>100% 


>100% 


7 


8.5&04 


>100% 


>100% 


>100% 


6 


3.7&03 


>100% 


>100% 


> 100% 


5 


1.6E-Q2 


>100% 


>100% 


>100% 


4 


6.4&Q2 


>100% 


>100% 


>100% 


3 


25&01 


>100% 


>100% 


>100% 


2 


7.1&01 


>100% 


>100P/o 


>100% 


1 


1.0BO0 


>100% 


>100% 


>100% 



The possibility of complimentary regions occurring within the genome of interest 
was not a major concern in the primer design process despite the reported tolerance of 

5 PCR to mismatches in complex sample populations (see, e.g., Rubin, E. and A. A. Levy. 
A mathematical model and a computerized simulation of PCR using complex templates. 
Nucleic Acids Res, 1996. 24(18): p. 3538-45). Although the probability of finding a 
perfect match to a 20 base primer is extremely low (4 20 ) (Table 3), the probability of 
finding less non-consecutive matches increases significantly with the size of the genome 

10 of interest. As a result, the probability of finding a perfect match of at least 10 of 20 
bases is 99.35% for an Adenovirus genome. The probability of finding a 16 base perfect 
match is 97% for the sequences in the NCBI database (approximately 100 times larger 
than the Adenovirus genome). The probability of finding a 1 7 base perfect match to a 20 
base primer is 99% for the sequences in the human genome (3 billion bases). 

1 5 The high probability of primer cross-hybridization to regions of the genome is 

less problematic than one might expect due to the random ONA digestion used to 
produce the template fragments. Thus, the effects of a cross-hybridizing region (CHR) 
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are fairly benign. It is unlikely that a CHR would be able to successfully compete with 
the perfect match between the PCR primers in solution and the template. In addition, any 
primers thai include mismatches at their 3* end would be at a significant competitive 
disadvantage. Even if a CHR should out compete the intended PCR primer, it would 

5 produce a truncated PCR product, without a downstream site for the sequencing primer. 
If the truncated product could be driven to the capture bead and immobilized, one of two 
situations would result. If the CHR out-competed the solution-phase primer, then the 
immobilized product would lack a sequencing primer binding site, and would result in an 
empty PicoTiter plate (PTP) well. If the CHR out-competed the bead-bound primer, the 

10 sequencing primer would still be present, and the only effect would be a shorter insert. 
Neither result would unduly compromise the sequencing quality. Given the large amount 
of genomic material used in the sample preparation process (currently 25 iig, containing 
5.29 x 10 16 copies of the 35 Kb Adenovirus genome), oversampling can be used to 
provide fragments that lack the complete CHR, and allow standard PCR amplification of 

1 5 the region in question. 

Example 3: Sample Preparation by Nebulization 
Preparation of DNA by Nebulization 

The purpose of the Nebulization step is to fragment a large stretch of DNA such 
20 as a whole genome or a large portion of a genome into smaller molecular species that arc 
amenable to DNA sequencing. This population of smaller-sized DNA species generated 
from a single DNA template is referred to as a library. Nebulization shears double- 
stranded template DNA into fragments ranging from 50 to 900 base pairs. The sheared 
library contains single-stranded ends that are end-repaired by a combination of T4 DNA 
25 polymerase, E. coli DNA polymerase I (Klenow fragment), and T4 polynucleotide 

kinase. Both T4 and Klenow DNA polymerases are used to "fill-in ' 3 ? recessed ends (5" 
overhangs) of DNA via their 5'-3 ? polymerase activity. The single-stranded 3'-5* 
exonuclease activity of T4 and Klenow polymerases will remove 3 ? overhang ends and 
the kinase activity of T4 polynucleotide kinase will add phosphates to 5' hydroxyl 
30 termini. 

The sample was prepared as follows: 
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1 . 15 )ig of gDN A (genomic DNA) was obtained and adjusted to a final 
volume of 100 pi in 10 mM TE (10 mM Tris, 0.1 niM EDTA, pH 7.6; sec reagent list at 
the end of section). The DNA was analyzed for contamination by measuring the O.D. 
260/280 ratio, which was 1.8 or higher. The final gDNA concentration was expected to be 

5 approximately 300 pg/ml. 

2. 1 600 \i\ of ice-cold Nebulization Buffer (see end of section) was added to 
the gDNA. 

3. The reaction mixture was placed in an ice-cold nebulizer (CIS-US, 
Bedford. MA). 

10 4. The cap from a 15 ml snap cap falcon tube was placed over the top of the 

nebulizer (Figure 28A). 

5. The cap was secured with a clean Nebulizer Clamp assembly, consisting 
of the fitted cover (for the falcon tube lid) and two rubber O-rings (Figure 28B). 

6. The bottom of the nebulizer was attached to a nitrogen supply and the 
1 5 entire device was wrapped in parafilm (Figures 28C and 28D). 

7. While maintaining nebulizer upright (as shown in Figure 28D), 50 psi 
(pounds per square inch) of nitrogen was applied for 5 minutes. The bottom of the 
nebulizer was tapped on a hard surface every few seconds to force condensed liquid to 
the bottom. 

20 8. Nitrogen was turned off after 5 minutes. After the pressure had 

normalized (30 seconds), the nitrogen source was remove from the nebulizer. 

9. The parafilm was removed and die nebulizer top was unscrewed. The 
sample was removed and transferred to a L5 ml microcentrifuge tube. 

10. The nebulizer top was reinstalled and the nebulizer was centrifuged at 500 
25 rpm for 5 minutes. 

I L The remainder of the sample in the nebulizer was collected. Total 
recovery was about 700 fxl. 

1 2. The recovered sample was purified using a QIAquick column (Qiagen 
Inc., Valencia, CA) according to manufacturer's directions. The large volume required 
30 the column to be loaded several times. The sample was eluted with 30 jal of Buffer EB 
(10 mM Tris HCI, pH 8.5;supplied in Qiagen kit) which was pre-warmed at 55°C. 
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1 3. The sample was quantitated by UV spectroscopy (2 pi in 1 98 pi water for 

1:100 dilution). 

Enzymatic Polishing 

5 Nebulization of DNA templates yields many fragments of DNA with frayed ends. 

These ends are made blunt and ready for ligation to adaptor fragments by using three 
enzymes, T4 DNA polymerase, £. coli DNA polymerase (Klenow fragment) and T4 
polynucleotide kinase. 

The sample was prepared as follows: 
10 1 . In a 0.2 ml tube the following reagents were added in order: 

28 |il purified, nebulized gDNA fragments 
5 pi water 

5 pi 10 X T4 DNA polymerase buffer 
5 ill BSA(lmg/ml) 
15 2pldNTPs(10mM) 

5 ul T4 DNA polymerase (3 units/uO 
50 pi final volume 

2. The solution of step I was mixed well and incubated at 25°C for 10 
minutes in a MJ thermocycler (any accurate incubator may be used). 
20 3. 1 .25 pi £. coli DNA polymerase (Klenow fragment) (5 units/ml) was 

added. 

4. The reaction was mixed well and incubated in the MJ thermocycler for 10 
minutes at 25°C and for an additional 2 hrs at 16°C. 

5. The treated DNA was purified using a QiaQuick column and eluted with 
25 30 pi of Buffer EB (10 mM Tris HCI, pH 8.5) which was pre- warmed at 55°C. 

6. The following reagents were combined in a 0.2 ml tube: 

30 pi Qiagen purified, polished, nebulized gDNA fragments 
5 pi water 

5 pi 10XT4PNK buffer 
30 5 pi ATP(lOmM) 

5ul T4PNK (10 units/ml) 
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50 ^il final volume 

7. The solution was mixed and placed in a MJ thermal cycler using the T4 
PNK program for incubation al 37°C for 30 minutes, 65°C for 20 minutes, followed by 
storage at 14°C. 

5 8. The sample was purified using a QiaQuick column and eluted in 30 \i\ of 

Buffer EB which was pre-warmed at 55°C. 

9. A 2 jil aliquot of the final polishing reaction was held for analysis using a 
BioAnalyzer DNA 1000 LabChip (see below). 



10 Ligation of Adaptors 

The procedure for ligating the adaptors was performed as follows: 

1 . In a 0.2 ml tube the following reagents were added in order: 
20.6 ^1 molecular biology grade water 

28 f^tl digested, polished gDNA Library 
15 60 ill 2 X Quick Ligase Reaction Buffer 

1 .8 pi MMP (200 pmol/nl) Universal Adaptor set 

9.6 \x\ Quick Ligase 

120 \il total 

The above reaction was designed for 5 jig and was scaled depending on the 
20 amount of gDNA used. 

2. The reagents were mixed well and incubated at 25°C for 20 minutes. The 
tube was on ice until the gel was prepared for agarose gel electrophoresis. 



Gel Electrophoresis and Extraction of Adapted gDNA Library 

25 Nebulization of genomic DNA yields a library population that ranges from 50- 

900 bp. The addition of the 88-bp Universal Adaptor set will shift the population to a 
larger size and will result in a migration profile with a larger size range (approximately 
130-980 bp). Adaptor dimers will migrate at 88 bp and adaptors not ligaled will migrate 
at 44 bp. Therefore, genomic DNA libraries isolated in size ranges >250 bp can be 

30 physically isolated from the agarose gel and purified using standard gel extraction 
techniques. Gel isolation of the adapted gDNA library will result in the recovery of a 
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library population in a size range that is >250 bp (size range of library can be varied 
depending on application). The library size range after ligation of adapters is 1 30 to 980 
bp. It should be noted that the procedure may be adapted for isolation of any band size 
range, such as, for example, 130 to 200 bp, 200 to 400 bp, 250 to 500 bp. 300 to 600 bp, 
5 500 to 700 bp and the like by cutting different regions of the gel. The procedure 
described below was used to isolated fragments of 250 bp to 500 bp. 

A 1 50 ml agarose gel was prepared to include 2% agarose, 1 X TBE. and 4.5 \i\ 
ethidium bromide (10 mg/ml stock). The ligated DNA was mixed with 10X Ready Load 
Dye and loaded onto the gel. In addition, 10 u:l of alOO-bp ladder (0.1 ng/|il) was loaded 

10 on two lanes away from the ligation reaction flanking the sample. The gel was 
electrophoresed at 100 V for 3 hours. When the gel run was complete, the gel was 
removed from the gel box, transferred to a GelDoc, and covered with plastic wrap. The 
DNA bands were visualized using the Prep UV light. A sterile, single-use scalpel, was 
used to cut out a library population from the agarose gel with fragment sizes of 250 - 500 

1 5 bp. This process was done as quickly as possible to prevent nicking of DNA. The gel 
slices were placed in a 1 5 ml falcon tube. The agarose-embedded gDNA library was 
isolated using a Qiagen MinElute Gel Extraction kit. Aliquots of each isolated gDNA 
library were analyzed using a BioAnalyzer DNA 1000 LabChip to assess the exact 
distribution of the gDNA library population. 

20 

Strand Displacement and Extension of the gDNA Library and Isolation of the Single 
Stranded gDNA Library Using Streptavidin Beads 

Strand displacement and extension of nicked double-stranded gDNA library was 
performed as described in Example I, with the exception that the /to-treated samples 

25 were incubated in the thermal cycler at 65° C for 30 minutes and placed on ice until 
needed. Streptavidin beads were prepared as described in Example I , except that the 
final wash was performed using two washes with 200 ^1 IX Binding buffer and two 
washes with 200 jil nhhO. Single-stranded gDNA library was isolated using streptavidin 
beads as follows. Water from the washed beads was removed and 250 |il of Melt 

30 Solution (see below) was added. The bead suspension was mixed well and incubated at 
room temperature for 10 minutes on a tube rotator. In a separate lube. 1 250 jil of PB 
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(from the QiaQuick Purification kit) and 9 \i\ of 20% acetic acid were mixed. The beads 
in 250 jil Melt Solution were pelleted using a Dynal MPC and the supernatant was 
carefully removed and transferred to the freshly prepared PB/acetic acid solution. DNA 
from the 1 500 pi solution was purified using a single MinElute purification spin column. 
5 This was performed by loading the sample through the same column twice at 750 |il per 
load. The single stranded gDNA library was eluted with 15 \il of Buffer EB which was 
pre-warmed at 55°C 

Single Strand gDNA Quantitation and Storage 
10 Single-stranded gDNA was quantitated using RNA Pico 6000 LabChip as 

described in Example 1. In some cases, the single stranded library was quantitated by a 
second assay to ensure the initial Agilent 2100 quantitation was performed accurately. 
For this purpose, RiboGreen quantitation was performed as described (ssDNA 
Quantitation by Fluoromelry) to confirm the Agilent 2100 quantitation. If the two 
15 estimates differed by more than 3 fold, each analysis was repeated. If the quantitation 
showed greater than a 3 fold difference between the two procedures, a broader range of 
template to bead was used. 

Dilution and storage of the single stranded gDNA library was performed as 
described in Example 1. The yield was as follows: 
20 Remaining final volume of ssDNA library following LabChip analysis - 12 

Remaining final volume of ssDNA library following RiboGreen analysis = 9 \il 
Final volume of ssDNA library after the addition of TE = 1 8 
An equal volume of TE was added to single-stranded gDNA library stock. 
Single-stranded gDNA library to 1 x 10 8 molecules/^tl in Buffer TE. Stock was diluted 
25 (1/500) to 200,000 molecules/^! in TE and 20 pi aliquots were prepared. 

Library Fragment Size Distribution After Nehulization 

Typical results from Agilent 2100 DNA 1000 LabChip analysis of 1 jil of the 
material following Nebulization and polishing are shown in Figure 29A. The size range 
30 distribution of the majority of the product was expected to fall around 50 to 900 base 
pairs. The mean size (lop of peak) was expected to be approximately 450 bp. Typical 
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results from gel purification of adaptor ligated library fragments are shown in Figure 
29B. 

Reagents 

5 Unless otherwise specified, the reagents listed in the Examples represent standard 

reagents that are commercially available. For example, Klenow, T4 DNA polymerase, 
T4 DNA polymerase buffer, T4 PNK, T4 PNK buffer, Quick T4 DNA Ligase, Quick 
Ligation Buffer, Bst DNA polymerase (Large Fragment) and ThermoPol reaction buffer 
are available from New England Biolabs (Beverly, MA). dNTP mix is available from 

10 Pierce (Rockford, 1L). Agarose, UltraPure TBE, BlueJuice gel loading buffer and Ready- 
Load lOObp DNA ladder maybe purchased from Invitrogen (Carlsbad, CA). Ethidium 
Bromide and 2-Propanol may be purchased from Fisher (Hampton, NH). RNA Ladder 
may be purchased from Ambion (Austin, TX). Other reagents are either commonly 
known and/or are listed below: 



15 

Melt Solution: 



Ingredient 


Quantity Required 


Vendor 


Stock Number 


NaCl (5 M) 


200 pi 


Invitrogen 


24740-011 


NaOH(lON) 


125 ul 


Fisher 


SS255-1 


molecular biology grade water 


9.675 ml 


Eppendorf 


0032-006-205 



The Melt Solution included 100 mM NaCl, and 125 mM NaOH. The listed 
reagents were combined and mixed thoroughly. The solution could be stored at RT for 
20 six months. 



Binding & Washing (B&W) Buffer (2X and IX): 



Ingredient 


Quantity Required 


Vendor 


Stock Number 


UltraPure Tris-HCl (pH 7.5, 1 M) 


250 pi 


Invitrogen 


15567-027 


EDTA (0.5 M) 


50 pi 


Invitrogen 


15575-020 


NaCl (5 M) 


10 ml 


Invitrogen 


24740-01 1 


molecular biology grade water 


14.7 ml 


Eppendorf 


0032-006-205 



The 2X B&W buffer included final concentrations of 10 mM Tris-HCl (pH 7.5), 1 
25 mM EDTA, and 2 M NaCl. The listed reagents were combined by combined and mixed 
thoroughly. The solution could be stored at RT for 6 months. The 1 X B&W buffer was 
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prepared by mixing 2X B&W buffer with picopure H 2 0, 1:1. The final concentrations 
was half of that listed the above, i.e., 5 mM Tris-HCl (pH 7.5), 0.5 inM EDTA, and 1 M 
NaCI. 

Other buffers included the following. IX T4 DNA Polymerase Buffer: 50 mM 
5 NaCI, 1 0 mM Tris-HCl, 1 0 mM MgCI2, I mM dithiothreitol (pH 7.9 @ 25°C). TE: 1 0 
mM Tris, 1 mM EDTA. 



Special Reagent preparation: 
TE (lOmM): 



Ingredient 


Quantity Required 


Vendor 


Stock Number 


TE(1M) 


1 ml 


Fisher 


BP1338-1 


molecular biology grade water 


99 ml 


Eppendorf 


0032-006-205 



10 



Reagents were mixed and the solution could be stored RT for six months. 



Nebulization Buffer: 



Ingredient 


Quantity Required 


Vendor 


Stock Number 


Glycerol 


53.1 ml 


Sigma 


G5516 


molecular biology grade water 


42.1 ml 


Eppendorf 


0032-006-205 


UltraPure Tris-HCl (pH 7.5, 1M) 


3.7 ml 


Invitrogen 


15567-027 


EDTA (0.5M) 


1.1 ml 


Sigma 


M- 10228 



] 5 All reagents were added (glycerol was added last) to a Stericup and mixed well. 

The solution was labeled and could be stored at RT for six months. 



ATP(lOmM): 



Ingredient 


Quantity Required 


Vendor 


Stock Number 


ATP (100 mM) 


10 Ml 


Roche 


1140965 


molecular biology grade water 


90 jtl 


Eppendorf 


0032-006-205 


The reagents were mixed and the solution could be stored at -20°C for six months. 


BSA(1 mg/ml): 








Ingredient 


Quantity Required 


Vendor 


Stock Number 


BSA (10 mg/ml) 


10 j.t! 


NEB 


M0203 kit 


Molecular Biology Grade water 


90 Ml 


Eppendorf 


0032-006-205 



The reagents were mixed and the solution could be stored at 4°C for six months. 
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Library Annealing buffer, 10X: 



Ingredient 


Quantity Req. 


Vendor 


Stock No. 


UltraPure Tris-HCl (pH 7.5, 1 M) 


200 ml 


Invitrogen 


15567-027 


Magnesium acetate, enzyme grade (1 M) 


10.72 g 


Fisher 


BP-2 15-500 


Molecular Biology Grade water 


~ 1 L 


Eppendorf 


0032-006-205 



The 10 X Annealing Buffer included 200 mM Tris (pH 7.5) and 50 mM 
5 magnesium acetate. For this buffer, 200 ml of Tris was added to 500 ml picopure H2O. 
Next, 10.72 g of magnesium acetate was added to the solution and dissolved completely. 
The solution was adjusted to a final volume of 1000 ml. The solution could be stored at 
4°C for six months. To avoid the potential for contamination of libraries, the buffer was 
aliquotted for single or short-term usage. 

10 

Adaptors: 



Adaptor "A" (400 jiM): 



- Ingredient 


Quantity Req. 


Vendor 


Stock No. 


Adaptor A (sense; HPLC-purified, 
phosphorothioate linkages. 44 bp. 1000 pmol/ul) 


10.0 pi 


IDT 


custom 


Adaptor A (antisense; HPLC-purified, 
Phosphorothioate linkages, 40 bp. 1000 pmol/fil) 


10.0 |il 


IDT 


custom 


Annealing buffer (1 OX) 


2.5 nl 


454 Corp. 


previous 
table 


molecular biology grade water 


2.5 |il 


Eppendorf 


0032-006-205 



15 For this solution, 10 pi of 1000 pmol/pl Adaptor A (44 bp, sense) was mixed with 

10 |xl of 1000 pmol/pl Adaptor A (40 bp, antisense), 2.5 pi of 10X Library Annealing 
Buffer, and 2.5 |il of water (Vf = 25 pi). The adaptors were annealed using the 
ANNEAL-A program (see Appendix, below) on the Sample Prep Lab thermal cycler. 
More details on adaptor design are provided in the Appendix. 



20 

Adaptor "B" (400 jiM): 



Ingredient 


Quantity Req. 


Vendor 


Stock No. 


Adaptor B (sense; HPLC-purified, 
phosphorothioate linkages. 40 bp, 1000 pmol/jil)) 


10 pi 


IDT 


Custom 


Adaptor B (anti; HPLC -purified, phosphorothioate 
linkages, 5'Biolinylated, 44 bp. 1000 pmol/pl) 


10 f.il 


IDT 


Custom 
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Ingredient 


Quantity Req. 


Vendor 


Stock No. 


Annealing buffer (10X) 


2.5 m.1 


454 Corp. 


previous tabic 


molecular biology grade water 


2.5 yi\ 


Eppendorf 


0032-006-205 



For this solution, 1 0 of 1 000 pmol/|il Adaptor B (40 bp, sense) was mixed with 
10 nl of 1000 pmol/nl Adaptor B (44 bp, anti), 2.5 \i\ of 10 X Library Annealing Buffer, 
and 2.5 |il of water (Vf = 25 \x\). The adaptors were annealed using the ANNEAL-A 
5 program (see Appendix) on the Sample Prep Lab thermal cycler. After annealing, 

adaptor "A" and adaptor U B" (Vf = 50 jil) were combined. Adaptor sets could be stored 
at -20°C until use. 



20% Acetic Acid: 



Ingredient 


Quantity Required 


Vendor 


Stock Number 


acetic acid, glacial 


2ml 


Fisher 


A35-500 


molecular biology grade water 


8 ml 


Eppendorf 


0032-006-205 



10 

For this solution, glacial acetic acid was added to the water.. The solution could 
be stored at RT for six months. 

Adaptor Annealing Program: 

1 5 ANNEAL-A program for primer annealing: 

1. Incubate at 95°C. 1 min; 

2. Reduce temperature to 1 5°C at 0. 1 °C/sec; and 

3. Holdatl4°C. 

20 T4 Polymerase/Klenow POLISH program for end repair: 

1. Incubate at 25°C, 10 minutes; 

2. Incubate at 16°C, 2 hours; and 

3. HoIdat4°C 

25 T4 PNK Program for end repair: 

1 . Incubate at 37°C, 30 minutes; 

2. Incubate at 65°C, 20 minutes; and 

3. Hold at 14°C. 
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BST program for sland displacement and extension of nicked double-stranded gDNA: 

1 . Incubate at 65° C, 30 minutes: and 

2. Hold at 14' C. 

5 

Step 9: Dilution and Storage of Single-Stranded DNA library 

Single-stranded DNA library in EB buffer: remaining final volume = 25 |il. 

Initial Stock dilution was made as follows. Using Pyrosequencing 
(Pyrosequencing AB, Uppsala, Sweden) results, single-stranded DNA library was diluted 
10 to 100M molecules/|iL in IX Annealing Buffer (usually this was a 1:50 dilution). 

Aliquots of single-stranded DNA Library were made for common use by diluting 
200,000 molecules/|AL in I X Annealing Buffer and preparing 30 \iL aliquots. Store at - 
20°C Samples were utilized in emulsion PCR. 

15 Reagent preparation : 

Stop Solution (50 mM EDTA): 100 jil of 0.5 M EDTA was mixed with 900 \il of 
nH 2 0 to make 1 .0 ml of 50 mM EDTA solution. 

Solution of 10 mM dNTPs included 10 jal dCTP (100 mM), 10 f.il dATP (100 
mM), 10 nl dGTP (100 mM), and 10 \xl dTTP (100 mM), 60 [il Molecular Biology Grade 
20 water. (nH 2 0). All four 1 00 mM nucleotide stocks were thawed on ice. 10 jil of each 
nucleotide was combined with 60 \il of nH 2 0 to a final volume of 100 jil, and mixed 
thoroughly. I ml aliquots were dispensed into 1 .5 ml microcentrifuge tubes, and stored at 
-20°C, no longer than one year. 

25 Annealing buffer, 10X: 10 X Annealing buffer included 200 mM Tris (pH 7.5) 

and 50 mM magnesium acetate. For this solution, 24.23 g Tris was added to 800 ml 
nH 2 0 and adjusted to pH 7.5. To this, 10.72 g magnesium acetate was added and 
dissolved completely. The solution was brought up to a final volume of 1 000 ml. The 
solution was able be stored at 4°C for 1 month. 
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lOx TE: 10 X TE included 100 mM Tris-HCl (pH 7.5). and 50 mM EDTA. 

These reagents were added together and mixed thoroughly. The solution could be stored 

at room temperature for 6 months. 



5 Example 4: Bead Emulsion PCR 

The following procedures, including capture of the template DNA, DNA 
amplification, and recovery of the beads bound to amplified template, can be performed 
in a single tube. The emulsion format ensures the physical separation of the beads into 
100-200 \im "microreactors * within this single tube, thus allowing for clonal 

1 0 amplification of the various templates. Immobilization of the amplification product is 
achieved through extension of the template along the oligonucleotides bound to the DNA 
capture beads. Typical, the copy number of the immobilized template ranges from 10 to 
30 million copies per bead. The DNA capture beads affixed with multiple copies of a 
single species of nucleic acid template are ready for distribution onto PTPs. 

15 The 300,000 75-picoliter wells etched in the PTP surface provide a unique array 

for the sequencing of short DNA templates in a massively parallel, efficient and cost- 
effective manner. However, this requires fairly large quantities (millions of copies) of 
clonal templates in each reaction well. The methods of the invention allow the user to 
clonally amplify single-stranded genomic template species thorough PCR reactions 

20 conducted in standard tubes or microliter plates. Single copies of the template species 
may be mixed with capture beads, resuspended into complete PCR amplification 
solution, and emulsified into microreactors (100 to 200 \im in diameter), after which PCR 
amplification generates 10 7 -fold amplification of the initial template species. This 
procedure is much simpler and more cost-effective than previous methods. 

25 

Binding Nucleic Acid Template to Capture Beads 

This example describes preparation of a population of beads that preferably have 
only one unique nucleic acid template attached thereto. Successful clonal amplification 
depends on the delivery of a controlled number of template species (0.5 to 1) to each 
30 bead. Delivery of excess species can result in PCR amplification of a mixed template 
population, preventing generation of meaningful sequence data while a deficiency of 
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species will result in fewer wells containing template for sequencing. This can reduce the 
extent of genome coverage provided by the sequencing phase. As a result, it is preferred 
that the template concentration be accurately determined through replicated quantitation, 
and that the binding protocol be followed as outlined below. 

5 

Template Quality Control 

The success of the Emulsion PCR reaction is related to the quality of the template 
species. Regardless of the care and detail paid to the amplification phase, poor quality 
templates will impede successful amplification and the generation of meaningful 

10 sequence data. To prevent unnecessary loss of time and money, it is important to check 
the quality of the template material before initiating the Emulsion PGR phase of the 
process. Preferably, the library should pass two quality control steps before it is used in 
Emulsion PCR. Its concentration and the distribution of products it contains should be 
determined. Ideally, the library should appear as a heterogeneous population of 

15 fragments with little or no visible adapter dimers (e.g., -90 bases). Also, amplification 
with PCR primers should result in a product smear ranging, for example, from 300 to 500 
bp. Absence of amplification product may reflect failure to properly ligate the adaptors 
to the template, while the presence of a single band of any size may reflect contamination 
of the template. 

20 

Preparation of the PCR solution 

The main consideration for this phase is to prevent contamination of the PCR 
reaction mixture with stray amplicons. Contamination of the PCR reactions with a 
residual amplicon is one of the critical issues that can cause failure of a sequencing run. 
25 To reduce the possibility of contamination, proper lab technique should be followed, and 
reaction mixture preparation should be conducted in a clean room in a UV-treated 
laminar flow hood. 
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PCR Reaction Mix: 

For 200 ^1 PCR reaction mixture (enough for amplifying 600,000 beads), the 
following reagents were combined in a 0.2 ml PCR tube: 



Table 4 





Stock 


Final 


Microliters 


H1FI Buffer 


10X 


1 X 


20 


treated nucleotides 


lOmM 


1 mM 


20 


Mg 


50 mM 


2mM 


8 


BSA 


10% 


0.1 % 


2 


Tween 80 


1 °/c 


0.01 % 


2 


Ppase 


2U 


0.003 U 


0.333333 


Primer MMPla 


100 


0.625 uM 


1.25 


Primer MMP lb 


IOuM 


0.078 uM 


1.56 


Taq polymerase 


5U 


0.2 U 


8 


Water 






136.6 


Total 






200 



The tube was vortexed thoroughly and stored on ice until the beads arc annealed 
with template. 

DNA Capture Beads : 

10 1. 600,000 DNA capture beads were transferred from the stock tube to a 1.5 

ml microfuge tube. The exact amount used will depend on bead concentration of 
formalized reagent. 

2. The beads were pelleted in a benchtop mini centrifuge and supernatant 
was removed. 

15 3. Steps 4-1 1 were performed in a PCR Clean Room. 

4. The beads were washed with 1 mL of 1 X Annealing Buffer. 

5. The capture beads were pelleted in the microcentrifuge. The tube was 
turned 180° and spun again. 

6. All but approximately 10 jil of the supernatant was removed from the tube 
20 containing the beads. The beads were not disturbed. 

7. 1 mL of IX Annealing Buffer was added and this mixture was incubated 
for 1 minute. The beads were then pelleted as in step 5. 

8. All but approximately 100 |iL of the material from the tube was removed. 
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9. The remaining beads and solution were transferred to a PCR tube. 

10. The 1.5 mL tube was washed with 150 jaL of IX Annealing Buffer by 
pipetting up and down several times. This was added to the PCR tube containing the 
beads. 

5 11. The beads were pelleted as in step 5 and all but 10 jaL of supernatant was 

removed, taking care to not disturb the bead pellet. 

12. An aliquot of quantitated single-stranded template DNA (sstDNA) was 
removed. The final concentration was 200,000-sst DNA molecules/|al. 

13. 3 (il of the diluted sstDNA was added to PCR tube containing the beads. 
10 This was equivalent to 600,000 copies of sstDNA. 

14. The tube was vortex ed gently to mix contents. 

15. The sstDNA was annealed to the capture beads in a PCR thermocycler 
with the program 80Armeal stored in the EPCR folder on the MJ Thermocycler, using the 
following protocol: 

15 • 5 minutes at 65°C; 

• Decrease by 0. 1 °C /sec to 60°C; 

• Hold at 60°C for 1 minute; 

• Decrease by 0. 1 °C /sec to 50°C: 

• Hold at 50°C for 1 minute; 

20 • Decrease by 0. 1 °C/sec to 40°C; 

• Hold at 40°C for 1 minute: 

• Decrease by 0. 1°C /sec to 20°C; and 

• Hold at 10°C until ready for next step. 

In most cases, beads were used for amplification immediately after template 
25 binding. If beads were not used immediately, they should were stored in the template 
solution at 4°C until needed. After storage, the beads were treated as follows. 

16. As in step 6, the beads were removed from the thermocycler, ccntrifuged, 
and annealing buffer was removed without disturbing the beads. 

17. The beads were stored in an ice bucket until emulsificalion (Example 2). 
30 18. The capture beads included, on average, 0.5 to 1 copies of sstDNA bound 

to each bead, and were ready for emulsification. 
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Example 5: Emulsification 

A PCR solution suitable for use in this step is described below. For 200 |il PCR 
reaction mix (enough for amplifying 600K beads), the following were added to a 0.2 ml 
5 PCR tube: 





Stock 


Final 


Microliters 


HIFI Buffer 


10X 


1 X 


20 


treated Nukes 


10 mM 


1 mM 


20 


Mg 


50 mM 


2 mM 


8 


BSA 


10% 


0.1 % 


2 


Tween 80 


1 % 


0.01 % 


2 


Ppase 


2U 


0.003 U 


0.333333 


Primer MMP1 a 


100 yM 


0.625 pM 


1.25 


Primer MMPtb 


10 gM 


0.078 gM 


1.56 


Taq 


5U 


0.2 U 


8 


Water 






136.6 


Total 






200 



This example describes how to create a heat-stable water-in-oil emulsion 
containing about 3,000 PCR microreactors per microliter. Outlined below is a protocol 
10 for preparing the emulsion. 

1. 200 \i\ of PCR solution was added to the 600,000 beads (both components 
from Example 1 ). 

2. The solution was pipetted up and down several times to resuspend the beads. 

3. The PCR-bead mixture was allowed to incubate at room temperature for 2 
1 5 minutes to equi I ibrate the beads with PCR solution. 

4. 400 \x\ of Emulsion Oil was added to a UV-irradiated 2 ml microfuge tube. 

5. An "ampl icon-free" 1/4" stir magnetic stir bar was added to the tube of 
Emulsion Oil. 

An amplicon-free stir bar was prepared as follows. A large stir bar was used to 
20 hold a 1/4" stir bar. The stir bar was then: 

• Washed with DNA-Off (drip or spray); 

• Rinsed with picopure water; 

• Dried with a Kimwipe edge; and 

• UV irradiated for 5 minutes. 
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6. The magnetic insert of a Dynal MPC-S tube holder was removed. The tube of 
Emulsion Oil was placed in the tube holder . The tube was set in the center of a stir plate 
set at 600 rpm. 

7. The tube was vortexed extensively to resuspend the beads. This ensured that 
5 there was minimal clumping of beads. 

8. Using a P-200 pipette, the PCR-bead mixture was added drop-wise to the 
spinning oil at a rate of about one drop every 2 seconds, allowing each drop to sink to the 
level of the magnetic stir bar and become emulsified before adding the next drop. The 
solution turned into a homogeneous milky white liquid with a viscosity similar to 

10 mayonnaise. 

9. Once the entire PCR-bead mixture was been added, the microfuge tube was 
flicked a few times to mix any oil at the surface with the milky emulsion. 

1 0. Stirring was continued for another 5 minutes. 

1 1 . Steps 9 and 10 were repeated. 

15 12. The stir bar was removed from the emulsified material by dragging it out of 

the tube with a larger stir bar. 

13. 10 \\L of the emulsion was removed and placed on a microscope slide. The 
emulsion was covered with a cover slip and the emulsion was inspected at 50X 
magnification (10X ocular and 5X objective lens). A "good" emulsion was expected to 
20 include primarily single beads in isolated droplets (microreactors) of PCR solution in oil. 

14. A suitable emulsion oil mixture with emulsion stabilizers was made as 
follows. The components for the emulsion mixture are shown in Table 5. 



Table 5 



Ingredient 


Quantity 
Required 


Source 


Ref. Number 


Sigma Light Mineral Oil 


94.5 g 


Sigma 


M-5904 


Atlox 4912 


lg 


Uniqenia 


NA 


Span 80 


4.5 g 


Uniqema 


NA 



25 The emulsion oil mixture was made by prewarming the Atlox 4912 to 60°C in a 

water bath. Then, 4.5 grams of Span 80 was added to 94.5 grams of mineral oil to form a 
mixture. Then, one gram of the prewarmed Atlox 4912 was added to the mixture. The 
solutions were placed in a closed container and mixed by shaking and inversion. Any 
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sign that the Allox was settling or solidifying was remedied by warming the mixture to 
60°C, followed by additional shaking. 



Example 6: Amplification 

5 This example describes amplification of the template DNA in the bead - emulsion 

mixture. According to this protocol of the invention, the DNA amplification phase of the 
processtakes 3 to 4 hours. After the amplification is complete, the emulsion may be left 
on the thermocycler for up to 12 hours before beginning the process of isolating the 
beads. PCR thermocycling was performed by placing 50 to 100 |il of the emulsified 

10 reaction mixture into individual PCR reaction chambers (i.e., PCR tubes). PCR was 
performed as follows: 

1. The emulsion was transferred in 50-100 amounts into approximately 
10 separate PCR tubes or a 96-weIl plate using a single pipette tip. For this step, the 
water-in-oil emulsion was highly viscous. 

1 5 2. The plate was sealed, or the PCR tube lids were closed, and the containers 

were placed into in a MJ thermocycler with or without a 96-well plate adaptor. 

3. The PCR thermocycler was programmed to run the following program: 

• I cycle (4 minutes at 94° C) - Hotstart Initiation; 

• 40 cycles (30 seconds at 94°C, 30 seconds at 58°C, 90 seconds at 68°C); 
20 • 25 cycles (30 seconds at 94°C, 6 minutes at 58°C); and 

• Storage at 14°C. 

4. After completion of the PCR reaction, the amplified material was removed in 
order to proceed with breaking the emulsion and bead recovery. 

25 Example 7: Breaking the Emulsion and Bead Recovery 

This example describes how to break the emulsion and recover the beads with 

amplified template thereon. Preferably, the post-PCR emulsion should remain intact. 

The lower phase of the emulsion should, by visual inspection, remain a milky white 

suspension. If the solution is clear, the emulsion may have partially resolved into its 
30 aqueous and oil phases, and it is likely that many of the beads will have a mixture of 

templates. If the emulsion has broken in one or two of the tubes, these samples should 
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not be combined with the others. If the emulsion has broken in all of the tubes, the 
procedure should not be continued. 

1. All PCR reactions from the original 600 |il sample were combined into a 
single 1.5 ml microfuge tube using a single pipette tip. As indicated above, the emulsion 

5 was quite viscous. In some cases, pipetting was repeated several times for each tube. As 
much material as possible was transferred to the 1.5 ml tube. 

2. The remaining emulsified material was recovered from each PCR tube by 
adding 50 nl of Sigma Mineral Oil into each sample. Using a single pipette tip, each tube 
was pipetted up and down a few times to resuspend the remaining material. 

10 3. This material was added to the 1 .5 ml tube containing the bulk of the 

emulsified material. 

4. The sample was vortexed for 30 seconds. 

5. The sample was spun for 20 minutes in the tabletop microfuge tube at 
13.2Krpm in the Eppendorf microcentrifuge. 

15 6. The emulsion separated into two phases with a large white interface. As 

much of the top, clear oil phase as possible was removed. The cloudy material was left in 
the tube. Often a white layer separated the oil and aqueous layers. Beads were often 
observed pelleted at the bottom of the tube. 

7. The aqueous layer above the beads was removed and saved for analysis 
20 (gel analysis, Agilent 21 00, and Taqman). If an interface of white material persisted 

above the aqueous layer, 20 microliters of the underlying aqueous layer was removed. 
This was performed by penetrating the interface material with a pipette tip and 
withdrawing the solution from underneath. 

8. In the PTP Fabrication and Surface Chemistry Room Fume Hood, 1 ml of 
25 Hexanes was added to the remainder of the emulsion. 

9. The sample was vortexed for 1 minute and spun at full speed for I minute. 

10. In the PTP Fabrication and Surface Chemistry Room Fume Hood, the top, 
oil/hexane phase was removed and placed into the organic waste container. . 

11. 1 ml of IX Annealing Buffer was added in 80% Ethanol to the remaining 
30 aqueous phase, interface, and beads. 
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12. The sample was vortexed for 1 minute or until the white substance 
dissolved. 

13. The sample was centrifiiged for 1 minute at high speed. The lube was 
rotated 180 degrees, and spun again for I minute. The supernatant was removed without 

5 disturbing the bead pellet. 

14. The beads were washed with 1 ml of IX Annealing Buffer containing 
0.1% Tween 20 and this step was repeated. 



Example 8: Single Strand Removal and Primer Annealing 

10 If the beads are to be used in a pyrophosphate-based sequencing reaction, then it 

is necessary to remove the second strand of the PCR product and anneal a sequencing 
primer to the single stranded template that is bound to the bead. This example describes 
a protocol for accomplishing that. 

1. The beads were washed with 1 ml of water, and spun twice for I minute. 
15 The tube was rotated 1 80° between spins. After spinning, the aqueous phase was 

removed. 

2. The beads were washed with 1 ml of 1 mM EDTA. The tube was spun as 
in step 1 and the aqueous phase was removed. 

3. 1 ml of 0.1 25 M NaOH was added and the sample was incubated for 8 
20 minutes. 

4. The sample was vortexed briefly and placed in a microcentrifuge. 

5. After 6 minutes, the beads were pelleted as in step 1 and as much solution 
as possible was removed. 

6. At the completion of the 8 minute NaOH incubation. 1 ml of IX 
25 Annealing Buffer was added. 

7. The sample was briefly vortexed, and the beads were pelleted as in step 1. 
As much supernatant as possible was removed, and another 1 ml of IX Annealing buffer 
was added. 

8. The sample was briefly vortexed, the beads were pelleted as in step 1 , and 
30 800 nl of IX Annealing Buffer was removed. 

9. The beads were transferred to a 0.2 ml PCR tube. 
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1 0. The beads were transferred and as much Annealing Buffer as possible was 
removed, without disturbing the beads. 

11. 1 00 |il of 1 X Annealing Buffer was added. 

12. 4 (il of 100 nM sequencing primer was added. The sample was vortexed 
5 just prior to annealing. 

1 3. Annealing was performed in a MJ thermocycler using the "80Annear 
program. 

14. The beads were washed three times with 200 jil of 1 X Annealing Buffer 
and resuspcnded with 100 |il of IX Annealing Buffer. 

10 15. The beads were counted in a Hausser Hemacytometer. Typically, 300,000 

to 500,000 beads were recovered (3,000-5,000 beads/[iL). 

16. Beads were stored at 4°C and could be used for sequencing for 1 week. 



Example 9: Optional Enrichment Step 

15 The beads may be enriched for amplicon containing bead using the following 

procedure. Enrichment is not necessary but it could be used to make subsequent 
molecular biology techniques, such as DNA sequencing, more efficient. 

Fifty microliters of 10 \xM (total 500 pmoles) of biotin-sequencing primer was 
added to the Sepharose beads containing amplicons from Example 5. The beads were 

20 placed in a thermocycler. The primer was annealed to the DNA on the bead by the 
thermocycler annealing program of Example 2. 

After annealing, the sepharose beads were washed three times with Annealing 
Buffer containing 0.1% Tween 20. The beads, now containing ssDNA fragments 
annealed with biotin-sequencing primers, were concentrated by centrifiigation and 

25 resuspended in 200 |il of BST binding buffer. Ten microliters of 50,000 unit/ml Bst- 
polymerase was added to the resuspended beads and the vessel holding the beads was 
placed on a rotator for five minutes. Two microliters of lOmM dNTP mixture (i.e., 2.5 \x\ 
each of 10 mM dATP, dGTP, dCTP and dTTP) was added and the mixture was incubated 
for an additional 10 minutes at room temperature. The beads were washed three times 

30 with annealing buffer containing 0. 1 % Tween 20 and resuspended in the original volume 
of annealing buffer. 
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Fifty microliters of Dynal Streptavidin beads (Dynal Biotech Inc., Lake Success, 
NY; M270 or MyOne™ beads at 10 mg/ml) was washed three times with Annealing 
Buffer containing 0.1% Tween 20 and resuspended in the original volume in Annealing 
Buffer containing 0.1% Tween 20. Then the Dynal bead mixture was added to the 
5 resuspended sepharose beads. The mixture was vortexed and placed in a rotator for 10 
minutes at room temperature. 

The beads were collected on the bottom of the test tube by ccntrifugation at 2300 
g (500 rpm for Eppendorf Centrifuge 5415D). The beads were resuspended in the 
original volume of Annealing Buffer containing 0.1% Tween 20. The mixture, in a test 

10 tube, was placed in a magnetic separator (Dynal). The beads were washed three times 
with Annealing Buffer containing 0.1% Tween 20 and resuspended in the original 
volume in the same buffer. The beads without amplicons were removed by wash steps, 
as previously described. Only Sepharose beads containing the appropriated DNA 
fragments were retained. 

15 The magnetic beads were separated from the sepharose beads by addition of 500 

\i\ of 0.125 M NaOH. The mixture was vortexed and the magnetic beads were removed 
by magnetic separation. The Sepharose beads remaining in solution was transferred to 
another tube and washed with 400 |il of 50 mM Tris Acetate until the pH was stabilized 
at 7.6. 

20 

Example 10: Nucleic Acid Sequencing Using Bead Emulsion PCR 

The following experiment was performed to test the efficacy of the bead emulsion 
PCR. For this protocol, 600,000 Sepharose beads, with an average diameter of 25-35 \m\ 
(as supplied my the manufacturer) were covalently attached to capture primers at a ratio 
25 of 30-50 million copies per bead. The beads with covalently attached capture primers 
were mixed with 1.2 million copies of single stranded Adenovirus Library. The library 
constructs included a sequence that was complimentary to the capture primer on the 
beads. 

The adenovirus library was annealed to the beads using the procedure described in 
30 Example 1. Then, the beads were resuspended in complete PCR solution. The PCR 
Solution and beads were emulsified in 2 volumes of spinning emulsification oil using the 
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same procedure described in Example 2. The emulsified (encapsulated) beads were 
subjected to amplification by PCR as outlined in Example 3. The emulsion was broken 
as outlined in Example 4. DNA on beads was rendered single stranded, sequencing 
primer was annealed using the procedure of Example 5. 
5 Next, 70,000 beads were sequenced simultaneously by pyrophosphate sequencing 

using a pyrophosphate sequencer from 454 Life Sciences (New Haven, CT) (see co- 
pending application of Lohman et al., filed concurrently herewith entitled Methods of 
Amplifying and Sequencing Nucleic Acids" USSN 60/476,592 filed June 6, 2003). 
Multiple batches of 70,000 beads were sequenced and the data were listed in Table 6, 
10 below. 



Table 6 



Alignment Error 
Tolerance 


Alignments 


Coverage 


Inferred 
Read Error 


None 


Single 


Multiple 


Unique 






0% 


47916 


1560 




1110 


54.98% 


0.00% 


5% 


46026 


3450 




2357 


83.16% 


1.88% 


10% 


43474 


6001 


1 


3742 


95.64% 


4.36% 



This table shows the results obtained from BLAST analysis comparing the 
sequences obtained from the pyrophosphate sequencer against Adenovirus sequence. 
15 The first column shows the error tolerance used in the BLAST program. The last column 
shows the real error as determined by direct comparison to the known sequence. 

BEAD EMULSION PCR FOR DOUBLE ENDED SEQUENCING 

20 Example 1 1 : Template Quality Control 

As indicated previously, the success of the Emulsion PCR reaction was found to 
be related to the quality of the single stranded template species. Accordingly, the quality 
of the template material was assessed with two separate quality controls before initiating 
the Emulsion PCR protocol. First, an aliquot of the single-stranded template was run on 
25 the 2100 BioAnalyzer (Agilient). An RNA Pico Chip was used to verify that the sample 
included a heterogeneous population of fragments, ranging in size from approximately 
200 to 500 bases. Second, the library was quantitated using the RiboGreen fluorescence 
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assay on a Bio-Tek FL600 plate fluoronieter. Samples determined lo have DNA 
concentrations below 5 ng/pl were deemed too dilute for use. 

Example 12: DNA Capture Bead Synthesis 

5 Packed beads from a I mL N-hydroxysuccinimide ester (NHS)-activated 

Sepharose HP affinity column (Amersham Biosciences, Piscataway, NJ) were removed 
from the column. The 30 -25 (am size beads were selected by serial passage through 30 
and 25 |im pore filter mesh sections (Sefar America, Depew, NY, USA). Beads that 
passed through the first filter, but were retained by the second were collected and 

10 activated as described in the product literature (Amersham Pharmacia Protocol # 

71 700600 AP). Two different amine-labeled HEG (hexaethyleneglycol) long capture 
primers were obtained, corresponding to the 5' end of the sense and antisense strand of 
the template to be amplified. (5 , -Amine-3 HEG spacers 

gcttacctgaccgacctctgcctatcccctgttgcgtgtc-3'; SEQ ID NO: 12; and 5'-Amine-3 HEG 
1 5 spacers ccaltccccagctcgtcttgccatctgttccctccctgtc-3'; SEQ ID NO: 1 3) (IDT Technologies, 
Coralville, IA, USA). The primers were designed to capture of both strands of the 
amplification products to allow double ended sequencing, i.e., sequencing the first and 
second strands of the amplification products. The capture primers were dissolved in 20 
mM phosphate buffer, pH 8.0, to obtain a final concentration of 1 mM. Three microliters 
20 of each primer were bound to the sieved 30 - 25 jam beads. The beads were then stored 
in a bead storage buffer (50 mM Tris, 0.02% Tween and 0.02% sodium azide, pH 8). 
The beads were quantitated with a hemacytometer (Hausser Scientific, Horsham, PA ? 
USA) and stored at 4°C until needed. 

25 Example 13: PGR Reaction Mix Preparation and Formulation 

As with any single molecule amplification technique, contamination of the 
reactions with foreign or residual amplicon from other experiments could interfere with a 
sequencing run. To reduce the possibility of contamination, the PCR reaction mix was 
prepared in a in a UV-treated laminar flow hood located in a PCR clean room. For each 
30 600,000 bead emulsion PCR reaction, the following reagents were mixed in a 1 .5 ml 
tube: 225 pi of reaction mixture (IX Platinum HiFi Buffer (Invitrogen)), 1 mM dNTPs, 
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2.5 mM MgS0 4 (lnvitrogen). 0.1% BSA. 0.01% Tween, 0.003 U/^l thermostable PPi-ase 
(NEB), 0.125 |iM forward primer (5>-gcttacctgaccgacctctg-3'; SEQ ID NO: 14) and 0.125 
\xM reverse primer (5'-ccattccccagctcgtcttg-3'; SEQ ID NO: 15) (IDT Technologies, 
Coralville, IA, USA) and 0.2 V/\xl Platinum Hi-Fi Taq Polymerase (Invitrogen). Twenty- 
5 five microliters of the reaction mixture was removed and stored in an individual 200 pi 
PCR tube for use as a negative control. Both the reaction mixture and negative controls 
were stored on ice until needed. 



Example 14: Binding Template Species to DNA Capture Beads 

1 0 Successful clonal DNA amplification for sequencing relates to the delivery of a 

controlled number of template species to each bead. For the experiments described 
herein below, the typical target template concentration was determined to be 0.5 template 
copies per capture bead. At this concentration, Poisson distribution dictates that 61% of 
the beads have no associated template, 30% have one species of template, and 9% have 

15 two or more template species. Delivery of excess species can result in the binding and 
subsequent amplification of a mixed population (2 or more species) on a single bead, 
preventing the generation of meaningful sequence data. However, delivery of too few 
species will result in fewer wells containing template (one species per bead), reducing the 
extent of sequencing coverage. Consequently, it was deemed that the single-stranded 

20 library template concentration was important. 

Template nucleic acid molecules were annealed to complimentary primers on the 
DNA capture beads by the following method, conducted in a UV-treated laminar flow 
hood. Six hundred thousand DNA capture beads suspended in bead storage buffer (see 
Example 9, above) were transferred to a 200 |il PCR tube. The tube was centrifuged in a 

25 benchtop mini centrifuge for 10 seconds, rotated 180*, and spun for an additional 10 
seconds to ensure even pellet formation. The supernatant was removed, and the beads 
were washed with 200 |il of Annealing Buffer (20 mM Tris, pH 7.5 and 5 mM 
magnesium acetate). The tube was vortexed for 5 seconds to resuspend the beads, and 
the beads were pelleted as before. All but approximately 10 jil of the supernatant above 

30 the beads was removed, and an additional 200 |il of Annealing Buffer was added. The 
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beads were again vortexed for 5 seconds, allowed to sit for 1 minute, and then pelleted as 
before. All but 10 \xl of supernatant was discarded. 

Next, 1.5 |il of 300,000 molecules/|il template library was added to the beads. The 
tube was vortexed for 5 seconds to mix the contents, and the templates were annealed to 

5 the beads in a controlled denaturation/annealing program preformed in an MJ 
thermocycler. The program allowed incubation for 5 minutes at 80°C, followed by a 
decrease by 0.1°C/sec to 70°C, incubation for 1 minute at 70°C, decrease by 0.1°C/sec to 
60°C, hold at 60°C for 1 minute, decrease by 0.1°C/sec to 50°C, hold at 50°C for 1 
minute, decrease by 0.1°C/sec to 20°C, hold at 20°C. Following completion of the 

10 annealing process, the beads were removed from the thermocycler, centrifuged as before, 
and the Annealing Buffer was carefully decanted. The capture beads included on average 
0.5 copy of single stranded template DNA bound to each bead, and were stored on ice 
until needed. 



15 Example 15: Emulsification 

The emulsification process creates a heat-stable water-in-oil emulsion containing 
10,000 discrete PCR microreactors per microliter. This serves as a matrix for single 
molecule, clonal amplification of the individual molecules of the target library. The 
reaction mixture and DNA capture beads for a single reaction were emulsified in the 

20 following manner. In a UV-treated laminar flow hood, 200 [il of PCR solution (from 
Example 10) was added to the tube containing the 600,000 DNA capture beads (from 
Example 1 1). The beads were resuspended through repeated pipetting. After this, the 
PCR-bead mixture was incubated at room temperature for at least 2 minutes, allowing the 
beads to equilibrate with the PCR solution. At the same time, 450 \x\ of Emulsion Oil 

25 (4.5 % (w:w) Span 80, 1 % (w:w) Atlox 4912 (Uniqema, Delaware) in light mineral oil 
(Sigma)) was aliquotted into a flat-topped 2 ml centrifuge tube (Dot Scientific) 
containing a sterile l A inch magnetic stir bar (Fischer). This tube was then placed in a 
custom-made plastic tube holding jig, which was then centered on a Fisher Isotemp 
digital stirring hotplate (Fisher Scientific) set to 450 RPM. 

30 The PCR-bead solution was vortexed for 15 seconds to resuspend the beads. The 

solution was then drawn into a 1 ml disposable plastic syringe (Benton-Dickenson) 
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affixed with a plastic safety syringe needle (Henry Schein). The syringe was placed into 
a syringe pump (Cole-Parmer) modified with an aluminum base unit orienting the pump 
vertically rather than horizontally (Figure 30). The tube with the emulsion oil was 
aligned on the stir plate so that it was centered below the plastic syringe needle and the 
5 magnetic stir bar was spinning properly. The syringe pump was set to dispense 0.6 ml at 
5.5 ml/hr. The PCR-bead solution was added to the emulsion oil in a dropwise fashion. 
Care was taken to ensure that the droplets did not contact the side of the tube as they fell 
into the spinning oil. 

Once the emulsion was formed, great care was taken to minimize agitation of the 

10 emulsion during both the emulsification process and the post-emulsification aliquotting 
steps. It was found that vortexing, rapid pipetting, or excessive mixing could cause the 
emulsion to break, destroying the discrete microreactors. In forming the emulsion, the 
two solutions turned into a homogeneous milky white mixture with the viscosity of 
mayonnaise. The contents of the syringe were emptied into the spinning oil. Then, the 

15 emulsion tube was removed from the holding jig, and gently flicked with a forefinger 
until any residual oil layer at the top of the emulsion disappeared. The tube was replaced 
in the holding jig, and stirred with the magnetic stir bar for an additional minute. The stir 
bar was removed from the emulsion by running a magnetic retrieval tool along the 
outside of the tube, and the stir bar was discarded. 

20 Twenty microliters of the emulsion was taken from the middle of the tube using a 

P100 pipettor and placed on a microscope slide. The larger pipette tips were used to 
minimize shear forces. The emulsion was inspected at SOX magnification to ensure that 
it was comprised predominantly of single beads in 30 to 150 micron diameter 
microreactors of PCR solution in oil (Figure 33). After visual examination, the 

25 emulsions were immediately amplified. 

Example 16: Amplification 

The emulsion was aliquotted into 7-8 separate PCR tubes. Each tube included 
approximately 75 \xl of the emulsion. The tubes were sealed and placed in a MJ 
30 thermocycler along with the 25 jil negative control described above. The following cycle 
times were used: 1 cycle of incubation for 4 minutes at 94°C (Hotstart Initiation), 30 
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cycles of incubation for 30 seconds at 94°C, and 1 50 seconds at 68°C (Amplification), 
and 40 cycles of incubation for 30 seconds at 94°C, and 360 seconds at 68°C 
(Hybridization and Extension). After completion of the PCR program, the tubes were 
removed and the emulsions were broken immediately or the reactions were stored at 10°C 
5 for up to 16 hours prior to initiating the breaking process. 

Example 17: Breaking the emulsion and bead recovery 

Following amplification, the emulstifications were examined for breakage 
(separation of the oil and water phases). Unbroken emulsions were combined into a 

10 single 1.5 ml microcentrifuge tube, while the occasional broken emulsion was discarded. 
As the emulsion samples were quite viscous, significant amounts remained in each PCR 
tube. The emulsion remaining in the tubes was recovered by adding 75 |il of mineral oil 
into each PCR tube and pipetting the mixture. This mixture was added to the 1.5 ml tube 
containing the bulk of the emulsified material. The 1.5 ml tube was then vortexed for 30 

15 seconds. After this, the tube was centrifuged for 20 minutes in the benchtop 
microcentrifuge at 13.2K rpm (full speed). 

After centrifugation, the emulsion separated into two phases with a large white 
interface. The clear, upper oil phase was discarded, while the cloudy interface material 
was left in the tube. In a chemical fume hood, 1 ml hexanes was added to the lower 

20 phase and interface layer. The mixture was vortexed for 1 minute and centrifuged at full 
speed for I minute in a benchtop microcentrifuge. The top, oil/hexane phase was 
removed and discarded. After this, 1 ml of 80% Elhanol/lX Annealing Buffer was added 
to the remaining aqueous phase, interface, and beads. This mixture was vortexed for 1 
minute or until the white material from the interface was dissolved. The sample was then 

25 centrifuged in a benchtop microcentrifuge for 1 minute at full speed. The tube was 

rotated 180 degrees, and spun again for an additional minute. The supernatant was then 
carefully removed without disturbing the bead pellet. 

The white bead pellet was washed twice with 1 ml Annealing Buffer containing 
0.1% Tween 20. The wash solution was discarded and the beads were pelleted after each 

30 wash as described above. The pellet was washed with 1 ml Picopure water. The beads 
were pelleted with the ccntrifugc-rotate-centrifuge method used previously. The aqueous 
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phase was carefully removed. The beads were then washed with 1 ml of I mM EDTA as 
before, except that the beads were briefly vortexed at a medium setting for 2 seconds 
prior to pelleting and supernatant removal. 

Amplified DNA, immobilized on the capture beads, was treated to obtain single 

5 stranded DNA. The second strand was removed by incubation in a basic melt solution. 
One ml of Melt Solution (0.125 M NaOH, 0.2 M NaCl) was subsequently added to the 
beads. The pellet was resuspended by vortexing at a medium setting for 2 seconds, and 
the tube placed in a Thennolyne LabQuake tube roller for 3 minutes. The beads were 
then pelleted as above, and the supernatant was carefully removed and discarded. The 

10 residual Melt solution was neutralized by the addition of 1 ml Annealing Buffer. After 
this, the beads were vortexed at medium speed for 2 seconds. The beads were pelleted, 
and the supernatant was removed as before. The Annealing Buffer wash was repeated, 
except that only 800 nl of the Annealing Buffer was removed after centrifugation. The 
beads and remaining Annealing Buffer were transferred to a 0.2 ml PCR tube. The beads 

1 5 were used immediately or stored at 4°C for up to 48 hours before continuing on to the 
enrichment process. 

Example 18: Optional Bead Enrichment 

The bead mass included beads with amplified, immobilized DNA strands, and 
20 empty or null beads. As mentioned previously, it was calculated that 61% of the beads 
lacked template DNA during the amplification process. Enrichment was used to 
selectively isolate beads with template DNA, thereby maximizing sequencing efficiency. 
The enrichment process is described in detail below. 

The single stranded beads from Example 14 were pelleted with the centrifuge- 
25 rotate-centrifuge method, and as much supernatant as possible was removed without 
disturbing the beads. Fifteen microliters of Annealing Buffer were added to the beads, 
followed by 2 |il of 100 biotinylated, 40 base enrichment primer (5'-Biotin- tetra- 
ethyleneglycol spacers ccattccccagctcgtcttgccatctgttccctccctgtctcag-3'; SEQ ID NO: 16). 
The primer was complimentary to the combined amplification and sequencing sites (each 
30 20 bases in length) on the 3* end of the bead-immobilized template. The solution was 
mixed by vortexing at a medium setting for 2 seconds, and the enrichment primers were 
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annealed to the immobilized DNA strands using a controlled denaturation/annealing 
program in an MJ thermocycler. The program consisted of the following cycle times and 
temperatures: incubation for 30 seconds at 65° C, decrease by 0.1 °C/sec to 58°C 
incubation for 90 seconds at 58° C, and hold at 10° C. 
5 While the primers were annealing, Dynal MyOne™ streptavidin beads were 

resuspend by gentle swirling. Next, 20 |.il of the MyOne™ beads were added to a 1.5 ml 
microcentrifuge tube containing 1 ml of Enhancing fluid (2 M NaCl, 10 mM Tris-HCl, 1 
mM EDTA, pH 7.5). The MyOne bead mixture was vortexed for 5 seconds, and the tube 
was placed in a Dynal MPC-S magnet. The paramagnetic beads were pelleted against the 

10 side of the microcentrifuge tube. The supernatant was carefully removed and discarded 
without disturbing the MyOne™ beads. The tube was removed from the magnet, and 100 
\x\ of enhancing fluid was added. The tube was vortexed for 3 seconds to resuspend the 
beads, and stored on ice until needed. 

Upon completion of the annealing program, 100 |.tl of annealing buffer was added 

15 to the PCR tube containing the DNA capture beads and enrichment primer. The tube 
vortexed for 5 seconds, and the contents were transferred to a fresh 1.5 ml 
microcentrifuge tube. The PCR tube in which the enrichment primer was annealed to the 
capture beads was washed once with 200 |il of annealing buffer, and the wash solution 
was added to the 1.5 ml tube. The beads were washed three times with 1 ml of annealing 

20 buffer, vortexed for 2 seconds, and pelleted as before. The supernatant was carefully 
removed. After the third wash, the beads were washed twice with 1 ml of ice cold 
Enhancing fluid. The beads were vortexed, pelleted, and the supernatant was removed as 
before. The beads were resuspended in 150 jil ice cold Enhancing fluid and the bead 
solution was added to the washed MyOne™ beads. 

25 The bead mixture was vortexed for 3 seconds and incubated at room temperature 

for 3 minutes on a LabQuake tube roller. The streptavidin-coated MyOne™ beads were 
bound to the biotinylated enrichment primers annealed to immobilized templates on the 
DNA capture beads. The beads were then centrifuged at 2,000 RPM for 3 minutes, after 
which the beads were vortexed with 2 second pulses until resuspended. The resuspended 

30 beads were placed on ice for 5 minutes. Following this, 500 |il of cold Enhancing fluid 
was added to the beads and the tube was inserted into a Dynal MPC-S magnet. The 
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beads were left undisturbed for 60 seconds to allow pelleting against the magnet. After 
this, the supernatant with excess MyOne™ and null DNA capture beads was carefully 
removed and discarded. 

The tube was removed from the MPC-S magnet, and I ml of cold enhancing fluid 
5 added to the beads. The beads were resuspended with gentle finger flicking. It was 
important not to vortex the beads at this time, as forcefiil mixing could break the link 
between the MyOne™ and DNA capture beads. The beads were returned to the magnet, 
and the supernatant removed. This wash was repeated three additional times to ensure 
removal of all null capture beads. To remove the annealed enrichment primers and 

10 MyOne™ beads, the DNA capture beads were resuspended in 400 ^1 of melting solution, 
vortexed for 5 seconds, and pelleted with the magnet. The supernatant with the enriched 
beads was transferred to a separate 1.5 ml microcentrifuge tube. For maximum recovery 
of the enriched beads, a second 400 |al aliquot of melting solution was added to the tube 
containing the MyOne™ beads. The beads were vortexed and pelleted as before. The 

1 5 supernatant from the second wash was removed and combined with the first bolus of 
enriched beads. The tube of spent MyOne™ beads was discarded. 

The microcentrifuge tube of enriched DNA capture beads was placed on the 
Dynal MPC-S magnet to pellet any residual MyOne™ beads. The enriched beads in the 
supernatant were transferred to a second 1.5 ml microcentrifuge tube and centrifuged. 

20 The supernatant was removed, and the beads were washed 3 times withl ml of annealing 
buffer to neutralize the residual melting solution. After the third wash, 800 \il of the 
supernatant was removed, and the remaining beads and solution were transferred to a 0.2 
ml PCR tube. The enriched beads were centrifuged at 2,000 RPM for 3 minutes and the 
supernatant decanted. Next. 20 jal of annealing buffer and 3 jil of two different 100 nM 

25 sequencing primers (5'-ccatctgttccctccclgtc-3'; SEQ ID NO: 1 7; and 5 - 

cctatcccctgttgcgtgtc-3' phosphate; SEQ ID NO: 18) were added. The tube was vortexed 
for 5 seconds, and placed in an MJ thermocycler for the following 4-stage annealing 
program: incubation for 5 minutes at 65°C, decrease by 0.1°C/sec to 50°C, incubation for 
1 minute at 50°C, decrease by 0.1°C/sec to 40°C, hold at 40°C for 1 minute, decrease by 

30 0. 1 °C /sec to 1 5°C, and hold at 1 5°C. 
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Upon completion of the annealing program, the beads were removed from 
thermocycler and pelleted by ccntrifugation for 10 seconds. The tube was rotated 180*, 
and spun for an additional 10 seconds. The supernatant was decanted and discarded, and 
200 ^1 of annealing buffer was added to the tube. The beads were resuspended with a 5 
5 second vortex, and pelleted as before. The supernatant was removed, and the beads 
resuspended in 100 ^1 annealing buffer. At this point, the beads were quantitated with a 
Multisizer 3 Coulter Counter (Becknian Coulter). Beads were stored at 4°C and were 
stable for at least 1 week. 

10 Example 19: Double Strand Sequencing 

For double strand sequencing, two different sequencing primers are used; an 
unmodified primer MMP7A and a 3' phosphorylated primer MMP2Bp. There are 
multiple steps in the process. This process is shown schematically in Figure 38. 

1. First Strand Sequencing. Sequencing of the first strand involves extension 
15 of the unmodified primer by a DNA polymerase through sequential addition of 

nucleotides for a predetermined number of cycles. 

2. CAPPING: The first strand sequencing was terminated by flowing a 
Capping Buffer containing 25 mM Tricine, 5 mM Mangesium acetate, 1 mM DTT, 0.4 
mg/ml PVP, 0.1 mg/ml BSA, 0.01% Tween and 2 \xM of each dideoxynucleotides and 2 

20 |iM of each deoxynucleotide. 

3. CLEAN: The residual deoxynucleotides and dideoxynucleotides was 
removed by flowing in Apyrase Buffer containing 25 mM Tricine, 5 mM Magnesium 
acetate, 1 mM DTT, 0.4 mg/ml PVP, 0.1 mg/ml BSA, 0.01% Tween and 8.5 units/L of 
Apyrase. 

25 4. CUTTING: The second blocked primer was unblocked by removing the 

phosphate group from the 3' end of the modified 3* phosphorylated primer by flowing a 
Cutting buffer containing 5 units/ml of Calf intestinal phosphatases. 

5. CONTINUE: The second unblocked primer was activated by addition of 
polymerase by flowing 1000 units/ml of DNA polymerases to capture all the available 

30 primer sites. 
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6. Second Strand Sequencing: Sequencing of the second strand by a DNA 

polymerase through sequential addition of nucleotides for a predetermined number of 

cycles. 

Using the methods described above, the genomic DNA of Staphylococcus aureus 
5 was sequenced. The results are presented in Figure 39. A total of 3 1 ,785 reads were 
obtained based on 15770 reads of the first strand and 160 1 5 reads of the second strand. 
Of these, a total of 1 1,799 reads were paired and 81 87 reads were unpaired obtaining a 
total coverage of 38%. 

Read lengths ranged from 60 to 130 with an average of 95 +/- 9 bases (Figure 40). 
10 The distribution of genome span and the number of wells of each genome span is shown 
in Figure 41. Representative alignment strings, from this genomic sequencing, are shown 
in Figure 42. 

Example 20: Template PCR 

15 30 micron NHS Sepharose beads were coupled with 1 mM of each of the 

following primers: 

MMP1A: cgtttcccctgtgtgccttg (SEQ ID NO: 19) 
MMP1B: ccatctgttgcgtgcgtgtc (SEQ ID NO:20) 

Drive-to-bead PCR was performed in a tube on the MJ thermocycler by adding 50 
20 |il of washed primer-coupled beads to a PCR master mix at a one-to-one volume-to- 
volume ratio. The PCR master mixture included: 

IX PCR buffer; 

1 mMofeachdNTP; 

0.625 nM primer MMP1 A; 
25 0.625 mM primer MMP IB; 

1 ^1 of 1 unit/(.il Hi Fi Taq (Invitrogen, San Diego, CA); and 

- 5-10 ng Template DNA (the DNA to be sequenced). 

The PCR reaction was performed by programming the MJ thermocycler for the 
following: incubation at 94°C for 3 minutes; 39 cycles of incubation at 94°C for 30 
30 seconds, 58°C for 30 seconds, 68°C for 30 seconds; followed by incubation at 94°C for 
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30 seconds and 58°C for 1 0 minutes; 10 cycles of incubation at 94°C for 30 seconds, 
58°C for 30 seconds, 68°C for 30 seconds; and storage at 1 0°C 

Example 21 : Template DNA Preparation and Annealing Sequencing Primer 
5 The beads from Example 1 were washed two times with distilled water; washed 

once with 1 mM EDTA, and incubated with 0. 125 M NaOH for 5 minutes. This removed 
the DNA strands not linked to the beads. Then, the beads were washed once with 50 mM 
Tris Acetate buffer, and twice with Annealing Buffer: 200mM Tris-Acelate, 50mM Mg 
Acetate, pH 7.5. Next, 500 pmoles of Sequencing Primer MMP7A (ccatctgttccctccctgtc; 
1 0 SEQ ID NO:2 1 ) and MMP2B-phos {cctatcccctgttgcgtgtc\ SEQ ID NO:22) were added to 
the beads. The primers were annealed with the following program on the MJ 
thermocycler: incubation at 60°C for 5 minutes; temperature drop of 0.1 degree per 
second to 50°C; incubation at 50°C for 5 minutes; temperature drop of 0.1 degree per 
second to 4°C; incubation at 40°C for 5 minutes; temperature drop of 0.1 degree per 
1 5 second to 10°C. The template was then sequenced using standard pyrophosphate 
sequencing. 

Example 22: Sequencing and Stopping of the First Strand 

The beads were spun into a 55 jam PicoTiter plate (PTP) at 3000 rpm for 10 

20 minutes. The PTP was placed on a rig and run using de novo sequencing for a 

predetermined number of cycles. The sequencing was stopped by capping the first 
strand. The first strand was capped by adding 100 |il of IX AB (50 mM Mg Acetate, 250 
mM Tricine), 1000 unit/ml BST polymerase, 0.4 mg/ml single strand DNA binding 
protein, 1 mM DTT, 0.4 mg/ml PVP (Polyvinyl Pyrolidone), 10 uM of each ddNTP, and 

25 2.5 \M of each dNTP. Apyrase was then flowed over in order to remove excess 
nucleotides by adding IX AB, 0.4 mg/ml PVP, 1 mM DTT, 0.1 mg/ml BSA, 0.125 
units/ml apyrase, incubated for 20 minutes. 
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Example 23: Preparation of Second Strand for Sequencing 

The second strand was unblocked by adding 100 jil of IX AB, 0.1 unit per ml 
poly nucleotide kinase, 5 mM DTT. The resultant template was sequenced using 
standard pyrophosphate sequencing (described, e.g., in US patent 6,274,320, 6258,568 
5 and 6,2 10,891 , incorporated herein by reference). The results of the sequencing method 
can be seen in Figure I OF where a fragment of 1 74 bp was sequenced on both ends using 
pyrophosphate sequencing and the methods described in these examples. 

Example 24: Sequence Analysis of Nucleic Acid on a Picotiter Plate 

10 The picotiter plate containing amplified nucleic acids as described in Example 2 is 

placed in a perfusion chamber. Then sulfiirylase, apyrase, and luciferase are delivered to 
the picotiter plate. 

The sequencing primer primes DNA synthesis extending into the insert suspected 
of having a polymorphism, as shown in Figures 1 1A-I ID. The sequencing primer is first 

15 extended by delivering into the perfusion chamber, in succession, a wash solution, a 

DNA polymerase, and one of dTTP, dGTP, dCTP, or a thio dATP (a dATP analog). The 
sulfurylase, luciferase, and apyrase, attached to the termini convert any PPi liberated as 
part of the sequencing reaction to detectable light. The apyrase present degrades any 
unreacted dNTP. Light is typically allowed to collect for 3 seconds (although 1-100, e.g., 

20 2-10 seconds is also suitable) by a CCD camera linked to the fiber imaging bundle, after 
which additional wash solution is added to the perfusion chamber to remove excess 
nucleotides and byproducts. The next nucleotide is then added, along with polymerase, 
thereby repeating the cycle. 

During the wash the collected light image is transferred from the CCD camera to 

25 a computer. Light emission is analyzed by the computer and used to determine whether 
the corresponding dNTP has been incorporated into the extended sequence primer. 
Addition of dNTPs and pyrophosphate sequencing reagents is repeated until the sequence 
of the insert region containing the suspected polymorphism is obtained. 
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Example 25: On Picotiter Plate PCR Amplification 

Picotiter plale Preparation: In a further embodiment, the single stranded library 
attached to beads arc distributed directly onto the picotiter plate and then the nucleic acid 
template on each bead is amplified (using PCR or other known amplification technology) 
5 to generate sufficient copy number of the template that will generate detectable signal in 
the pyrophosphate-based sequencing methods disclosed herein. 

Example 26: Sequence Analysis of Nucleic Acid on a PTP 

Reagents used for sequence analysis and as controls were the four nucleotides and 

10 0.1 |iM Pyrophosphate (PPi) were made in substrate solution. Substrate solution refers to 
a mixture of 300 jiM Luciferin and 4 \iM adenosine 5'-phosphosulfate, APS, which arc 
the substrates for the cascade of reactions involving PPi, Luciferase and Sulfurylase. The 
substrate was made in assay buffer. The concentration of PPi used to test the enzymes 
and determine the background levels of reagents passing through the chamber was 0.1 

15 fiM. The concentration of the nucleotides, dTTP, dGTP, dCTP was 6.5 jiM and that of 
ctdATP was 50 |iM. Each of the nucleotides was mixed with DNA polymerase, Klenow 
at a concentration of 100 U/mL. 

The PTP was placed in the flow chamber of the embodied instrument, and the 
flow chamber was attached to the faceplate of the CCD camera. The PTP was washed by 

20 flowing substrate (3 ml per min, 2 min) through the chamber. After this, a sequence of 
reagents was flown through the chamber by the pump connected to an actuator, which 
was programmed to switch positions, which had tubes inserted in the different reagents. 
The sequence of reagents, flow rates, and flow times were determined. The camera was 
set up in a fast acquisition mode, with exposure time = 2.5 s. 

25 The signal output from the pad was determined as the average of counts on all the 

pixels within the pad. The frame number was equivalent tothe time passed during the 
experiment. Graphing was used to represent the flow of the different reagents. 

Example 27: Plate-based Platform for Picoliterscale PCR Reactions 
30 Materials and methods 
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Unless otherwise indicated, all common laboratory chemicals were purchased 

either from Sigma (Sigma-Aldrich Corporation, St. Louis, Ml) or Fisher (Fisher 

Scientific, Pittsburgh, PA). 

The PicoTiterPlates™ (25 x 75 x 2 mm) were manufactured by anisotropic 

5 etching of fiber optic faceplates in a manner similar to that previously described 

(Pantano, P. and Walt, D.R., Chemistry' of Materials 1996, 8, 2832-2835). Plates were 
etched in three different microwell depths, 26, 50 and 76 urn. Microwell center-to-center 
pitch was 50 \im, and well diameters ranged between 39 and 44 ^im (See Figure 14) ? with 
a calculated well density of 480 wells/mm 2 . 

10 Solid-phase immobilization of oligonucleotide primers: Packed beads from a 1 

ml NHS-activated Sepharose HP affinity column (Amersham Biosciences, Piscataway, 
NJ) were removed from the column and activated according to the manufacturer* s 
instructions (Amersham Pharmacia Protocol # 7 1 700600AP). Twenty- five microliters of 
a I mM amine-labeled HEG capture primer (5"-Amine-3 hexaethyleneglycol spacers 

15 ccatctgttgcgtgcgtgtc - 3*; SEQ ID NO:23) (IDT Technologies, Coralvillc, IA) in 20 mM 
phosphate buffer pH 8.0 were bound to the beads. After this, 36 to 25 jim beads were 
selected by serial passage through 36 and 25 \xm pore filter mesh sections (Sefar 
America, Depew, NY). DNA capture beads that passed through the first filter, but were 
retained by the second were collected in bead storage buffer (50 mM Tris, 0.02% Tween, 

20 0.02% Sodium Azide, pH 8), quantitated with a hemacytometer (Hausser Scientific, 
Horsham, PA) and stored at 4° C until needed. 

Generation of test DNA fragments: Amplification test fragments were derived 
from a commercially available adenovirus serotype 5 vector, pAdEasy (Stratagene, La 
Jolla, CA). Fragments were amplified using bipartite PCR primers, the 5' end of which 

25 contained a 20 base amplification region, and a 20 base 3" section, complementary to a 
specific region of the adenovirus genome. Using these primers, two fragments were 
amplified from the 12933-13070 and 5659-5767 position of the adenovirus genome and 
assigned labels Fragment A and Fragment B, respectively. 

The sequences for the forward and reverse primers for Fragment A was as 

30 follows. A slash (/) denotes the separation between the two regions of the primer: 
forward {5 - cgtttcccctgtgtgccttg / catcttgtccactaggctct-3 ? ; SEQ ID NO:24-SEQ ID 
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NO:25), and reverse (S'-ccalctgtlgcgtgcgtgtc / accagcactcgcaccacc-3': SEQ ID NO:26- 
SEQ ID NO:27). The primers for the Fragment B included: forward (5 ? - 
cgtttcccctgtgtgccttg / tacctctccgcgtaggcg-3*: SEQ ID NO:28-SEQ ED NO:29), and reverse 
(5*-ccatctgttgcgtgcgtgtc / ccccggacgagacgcag-3*; SEQ ID NO:30-SEQ ID NO:31). 
5 Reaction conditions included 50 mM KCI, 10 mM Tris-HCl (pH 9.0), 0. 1% Triton 

X-100, 2.5 mM MgCI 2 , 0.2 mM dNTP, 1 \xM each forward and reverse primer, 0.1 U/jiI 
Taq (Promega, Madison, Wl) and 50 nmol template DNA. Both templates were 
amplified with a PCR program that include 35 cycles of incubation at 94°C for 30 
seconds, 56°C for 30 seconds, and 72°C for 90 seconds. With PCR primers, the total 
10 length of the amplified fragments was 1 78 bp for Fragment A and 1 48 bp for Fragment 
B. 

To generate fluorescent probes, biotinylated double stranded fluorescent probes 
were prepared by PCR amplification from the pAdEasy vector as described above. 
However, the primer sequences were changed to prevent hybridization between the test 

15 fragment and probe primer regions. In addition, the reverse primers for both fragments 
utilized a 5'biotin followed by 3X hexaethyleneglycol spacers to permit product 
immobilization to beads prior to elution of the single stranded probe. 

The sequence for the forward primer for the fluorescent Fragment A probe was as 
follows. A slash (/) denotes the separation between the two regions of the primer (5*- 

20 atctctgcctactaaccatgaag / catcUgtccactaggctctO 1 ; SEQ ID NO:32-SEQ ID NO:33). The 
sequence for the reverse primer was 5'-biotin -3X hexaethyleneglycol spacers - 
gtttctctccagcctctcaccga / accagcaclcgcaccacc-3 , ; SEQ ID NO:34-SEQ ID NO:35. The 
primers for the Fragment B were as follows: forward (5' -atctctgcctactaaccatgaag/ 
tacctctccgcgtaggcg^; SEQ ID NO:36-SEQ ID NO:37), and reverse (5'-biotin-3x 

25 hexaethyleneglycol spacers - gtttctctccagcctctcaccga / ccccggacgagacgcag-3 ' : SEQ ID 
NO:38-SEQIDNO:39). 

Fluorescent moieties were incorporated through the nucleotide mixture. This 
included 0.2 mM dATP/dGTP/dCTP, 0. 1 5 mM TTP and 0.05 mM Alexa Fluor 488 - 
dUTP (Molecular Probes, Eugene, OR) for Fragment A. Alternately, 0.2 mM 

30 dATP/dGTP/TTP, 0. 15 mM dCTP and 0.05 mM Alexa Fluor 647 - dCTP (Molecular 
Probes, Eugene, OR) was used for amplifying Fragment B. The fluorescent products 
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were purified with a QIAquick PCR Purification Kit (Qiagen, Valencia, CA). The 
biotinylated DNA was subsequently bound to 100 [\\ (approximately 8.1 million) 
Streptavidin Sepharose High Performance beads (Amersham Biosciences) in 1 X binding 
wash (5 mM Tris HC1 pH 7.5, 1 M NaCK 0.5 mM EDTA, 0.05% Tween-20) for 2 hours 

5 at room temperature. After incubation, the beads were washed three times in TE buffer 
(10 mM Tris, 1 mM EDTA, pH 8.0) and incubated with 250 ^il melt solution (0.125 N 
NaOH/0.1 M NaCl) for 2 minutes, releasing the single stranded probe from the beads. 

Beads were pelleted with brief centrifugation in a benchtop centrifuge and the 
supernatant was neutralized in 1 .25 ml buffer PB (Qiagen) with 1 .9 |il glacial acetic acid. 

10 This mixture was repurified on a QiaQuick column (Qiagen), and the concentration of the 
purified probe was determined by TaqMan quantification using the BioRad iCycler 
(BioRad, Hercules, CA). 

Solution-phase PTPCR was performed as follows. The PCR reaction mixture was 
loaded into individual wells of a single 14 mm x 43 mm PicoTiterPlate™. For this, 500 

15 \i\ of PCR reaction mixture (IX Platinum HiFi Buffer (Invitrogen, Carlsbad, CA), 2.5 
mM MgS0 4 , 0.5% BSA, ImM dNTPs (MBI Fermenlas, Hanover, MD), 1 \xM forward 
(5' - cgtttcccctgtgtgccttg - 3 ? ; SEQ ID NO:40) and reverse (5' - ccatctgttgcgtgcgtgtc - 
3'; SEQ ID NO:41) primers, 0.05% Tween-80. 1 l)/\xl Platinum High Fidelity DNA 
Polymerase (Invitrogen), 0.003 U/1.1I Thermostable Pyrophosphatase (USB, Cleveland, 

20 OH), and a calculated 5 copies of Fragment B template per well) were combined in a 1.5 
ml microcentrifuge tube. The tube was vortexed thoroughly and stored on ice until the 
PicoTiterPlate™ loading cartridge was assembled. 

The in-house PicoTiterPlate™ loading cartridge was attached to the 
PicoTiterPlate™ with two plastic clips, seating the silicon cartridge gasket firmly on the 

25 PicoTiterPlate™ surface (see Figure 20). The PCR reaction mix was drawn into a 1 ml 
disposable syringe, and the mouth of the syringe inserted into the input tube of the 
loading cartridge. The loading cartridge was placed on end, so that the input port was 
oriented at the bottom of cartridge, and the PCR mix was slowly loaded into the chamber. 
While loading, inspected through the transparent back of the PicoTiterPlate™ to ensure 

30 even, bubble-free delivery. 
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After loading, (he PCR mixture was allowed to incubate for 5 minutes, at which 
time the reaction mixture was withdrawn from the PicoTiterPlate™ loading cartridge. 
The PicoTiterPlate™ was removed from the loading cartridge, and immediately placed in 
the amplification chamber (see Figure 21 ). The PicoTiterPlate™ surface was covered 

5 with a 0.25 mm thick Silpad A-2000 silicon sheet (The Bergquist Company, Chanhassen, 
MN). On top of this was placed a 25 mm x 75 mm standard glass microscope slide 
(Fisher). A closed cell foam insulation pad (Wicks Aircraft Supply, Highland, IL) was 
placed on top of the microscope slide. An aluminum lid was attached to the base of the 
chamber by six 25 mm bolts sealed the amplification chamber. 

10 Once sealed, the amplification chamber was placed on a Thctmocycler MJ PTC 

225 Tetrad (MJ Research, Waltham, MA) equipped with Flat Block Alpha Units. The 
amplification program included incubation for 3 minutes at 94°C (Hotstart Initiation) 
followed by 40 cycles of incubation for 12 seconds at 94°C, 12 seconds at 58°C 12 
seconds at 68°C, with a 10° C final hold. After completion of the PCR program, the 

15 PicoTiterPlate™ was removed from the amplification chamber, and the loading cartridge 
was reattached. A disposable syringe was used to fill the cartridge chamber with 1 ml of 
H 2 0, and allowed to incubate for 20 minutes 10° C at room temperature. 

After the incubation was completed, the recovery solution was withdrawn from 
the loading cartridge and transferred to a 1 .5 ml microcentrifuge tube. PCR product was 

20 quantitated using an iCycler RealTime PCR unit (BioRad) and FAM-labeled reporter 
probes (Epoch Biosciences, Bothell, WA). The TaqMan Universal PCR MasterMix 
(Applied Biosystems, Foster City, CA) was combined with 0.3 |iM forward and reverse 
primers. 0.15 (iM FAM-labeled probe, and 27 \x\ of the reaction mix added to each well 
of a 96 well PCR plate. 

25 Purified fragments were used to create a standard curve (six standards ranging 

from 1 x 10 9 to 1 x 10 4 molecules per well), which was run in triplicate. The PCR 
amplification was run with the following parameters: incubation for 5 minutes at 94°C 
(hotstart initiation), 60 cycles of incubation for 15 seconds at 94°C, 45 seconds at 68°C 
with a final hold at 4°C. Data was analyzed using the iCycler Optical Systems Software 

30 Version 2.3 (BioRad), and the PCR yield was quantitated using the iCycler data and 
Microsoft Excel (Microsoft, Redmond, WA). 
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Solid-phase PTPCR was performed similarly to solution phase PTPCR, except 
that DNA capture beads were loaded into the PicoTiterPlate™ wells prior to 
amplification by centrifugation as described below. In addition, the PCR mixture was 
loaded into the microwells after the bead deposition was completed. To facilitate 
5 retention of the capture beads during wash steps, the solid phase experiments utilized 50 
\im deep PicoTiterPlate™s. The PicoTiterPlate™ was placed in an in-house built 
plexiglass bead loading jig. This was similar to the PicoTiterPlate™ loading jig 
described in Figure 20, except that the PicoTiterPlate™ was sandwiched between a 
bottom Plexiglas plate and a jig top plate, containing inlet and outlet ports, and sealed via 

10 a si licon gasket with plastic screws. 

Template DNA was preannealed to the DNA capture beads at 5 template copies 
per bead by incubation at 80°C for 3 minutes, after which beads were allowed to cool to 
room temperature for 15 minutes. The beads were then spun into the PicoTiterPlate™ 
wells prior to loading the PCR reaction mixture. Bead Loading Buffer (450 jil; IX 

1 5 Platinum HiFi PCR buffer (Invitrogen), 0.02% Tween-80) containing one hundred 
thousand Sepharose DNA capture beads (approximately 1 bead per 3 PicoTiterPlate™ 
wells) were injected by pipette into the jig through one of the inlet ports. Each inlet hole 
was then sealed with a circular adhesive pad (3M VHS, St. Paul, MN). The jig held the 
PicoTiterPlate™ with its wells facing up and covered with the bead suspension. This was 

20 centriftiged at 2000 rpm for 5 minutes at room temperature in an AUegra 6 centrifuge 
(Beckman Coulter, Fullerton, CA) using a Microtiter Rotor. 

After centrifugation, the PicoTiterPlate™ was removed from the jig. The PCR 
reaction mix was loaded onto the PicoTiterPlate™ as described for solution phase PCR. 
However, the solid-phase PCR mixture omitted template since the template was 

25 preannealed to the DNA capture beads. The solid-phase PCR amplification program 
included additional hybridization/extension cycles to compensate for the slower kinetics 
of the immobilized primer. The program included incubation for 3 minutes at 94°C for 
hotstart initiation, 40 cycles of incubation for 12 seconds at 94°C 12 seconds at 58°C, 12 
seconds at 68°C, followed by 10 cycles of incubation for 12 seconds at 94°C, 10 minutes 

30 at 68°C for hybridization and extension, with a 10°C final hold. 
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Upon completion of the PCR program, the PicoTiterPlate™ was removed from 
the amplification chamber, and washed with 1 ml H 2 0 as described for solution phase 
PCR. The PicoTiterPlate™ was then prepared for hybridization detection of immobilized 
PCR product. 

5 Hybridization was performed with fluoresccntly labeled probes as follows. After 

PTPCR was complete, the strand complementary to the immobilized strand was removed. 
For this, the whole PicoTiterPlate™ was incubated in 0.125 M NaOH for 8 minutes at 
room temperature. This solution was neutralized by two 5 minute washes in 50 ml of 20 
mM Tris-acetate pH 7.5. The PicoTiterPlate™ was then placed in a custom-made 800 pi 

1 0 hybridization chamber, and blocked with hybridization buffer (3.5X SSC, 3.0% SDS, 
20X SSC buffer is 3 M NaCl; 0.3 M Na 3 -citrate) at 65°C for 30 minutes. The contents of 
the chamber were replaced with fresh hybridization buffer containing the probes: 20 nM 
fluorescent Fragment A (Alexa-488) and Fragment B (Alexa-647). The probes were 
allowed to hybridize to their targets. Incubation was carried out at 65°C for 4 hours 

15 while shaking at 200 RPM on an orbital shaker (Barnslead International, Dubuque, I A). 

After hybridization, the PicoTiterPlate™ was washed with 2X SSC, 0.1% SDS for 
15 minutes at 37°C, followed by a 15 minute wash in Ix SSC at 37°C. with two final 15 
minute washes in 0.2x SSC at 37°C. Following post-hybridization washing, the 
PicoTiterPlates ™ were air dried and placed in a FLA-8G00 Fluorescent Image Analyzer 

20 (Fujifilm Medical Systems USA, Stamford, CT) and scanned at the 635 and 473 nm 
wavelength. The resulting 16-bit tiff images were imported into Genepix 4.0 (Axon 
Instruments, Union City, CA). A block of 100 analysis features was drawn over the area 
of interest and the 635 and 473 fluorescence intensities were recorded for each feature. 
Data was then exported to Microsoft Excel for further analysis. 

25 Control beads were prepared as follows. Biolinylated test templates A and B 

were prepared by PCR amplification from the pAdEasy vector, purified, immobilized on 
Streptavidin Sepharose High Performance beads and strand separated as described under 
"Preparation of Fluorescent Probes '. However, fluorescently labeled dNTPs were 
ommitted in the PCR reaction. Pelleted beads were washed 3 times with TE buffer and 

30 stored at 4°C in TE until deposition onto the PicoTiterPlate™. 
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Results 

Solution-phase amplification was demonstrated by loading PicoTitcrPlates with 
PCR master mix containing a calculated 5 template copies per PicoTiterPIate™ well. 
Reactions were run in duplicate in PicoTiterPlates with 26, 50 and 76 |im deep wells. 

5 Forty cycles of PTPCR amplification were performed as described in Material and 

Methods. Additives were incorporated to prevent the deleterious surface effects routinely 
reported with silica reaction vessels (Kalinina, O., et ai, Nucleic Acids Res. 1997, 25, 
1 999-2004; Wittwer, C.T. and Garling, D.J., Bioteclmiques 199 1, 10, 76-83; Taylor, T.B., 
etaL Nucleic Acids Res. 1997, 25, 3164-3168). 

10 The inclusion of 0.5% BSA and 0.05% Tween-80 in the reaction mix was not 

only effective at reducing surface effects, it also facilitated amplification. Reducing the 
relative concentrations of either reagent had a negative effect on amplification. In 
addition, due to the polymerase-inactivating properties of silica surfaces (Taylor, T.B., et 
al y Nucleic Acids Key. 1997, 25, 3164-3168; Shoffner, M.A., Cheng, L, Hvichia, G.E., 

15 Kricka, LJ. and Wilding, P., Nucleic Acids Res. 1996, 24, 375-379), elevated Taq 
concentrations proved beneficial. Concentrations above 1 U/|il were optimum for 
enhancing amplicon yield. 

Following PTPCR, the solution from each PicoTiterPIate™ was recovered and 
triplicate samples of each solution were quantified by TaqMan assay. A standard curve 

20 of diluted template (linear from 1 x 10 9 to 10 4 molecules, r 2 =0.995) was used to 
determine the concentration of the amplified product. The number of molecules 
amplified per well was obtained by dividing the amount of amplified product by the total 
number of wells in a PicoTiterPIate™ (372,380). The amount of amplification per well 
was calculated by dividing this number by the initial template concentration per well. 

25 PTPCR amplification was successful in all of the PicoTiterPIate™, with yields ranging 
from 2.36 x 10 6 fold in the 39.5 pi wells to 1.28 x 10 9 fold in the 50 pi wells (See Table 
below). 



PicoTiterPIate 
Depth [urn] 


Well 

Volume [pi] 


Average Fold 
Amplification N=6 


Fold 

Amplification SD 


Final Product 
Cone. [M] 


26 


39.5 


2.36E+06 


1.02E+06 


4.96E-07 


50 


76.0 


1.28E+09 


1.03E+O9 


1.40E-04 
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76 


115.6 


9.10E+08 


4.95E+08 


6.54E-05 



The tabic shows PicoTiterPlate™ PCR amplification as determined by TaqMan 
Assay. Values reflect triplicate measurements taken from duplicate PicoTiterPlates. 
(N=6); SD = standard deviation. 

5 

Yield was influenced by well volume. The concentration of final product 
obtained for the 50 urn deep wells (1.4 x 10 * M) was significantly greater (p value for 
ANOVA = 0.023) than that obtained in the 76 \xm (6.54 x 10 " 5 M) deep wells, both were 
two orders of magnitude greater than the yield achieved in the 26 \im deep wells (4.96 x 

10 1 0 ' 7 M). The 50 |im deep microwell yield represented the optimal balancing of the costs 
and benefits associated with low-volume PCR. In this case, maximum elevation of the 
effective concentrations and low thermal mass of the reagents were obtained, but the 
surface to volume ratio was still low enough to prevent detrimental surface effects from 
significantly reducing amplification efficiency. 

15 The final concentration of PTPCR product obtained in each of the different well 

depths (4.96 x 10 7 to 1.4 x 10 M) exceeded the 10 ~ 8 M concentration typically reported 
as the maximum achievable before the PCR plateau effect occurs (Sardelli, A., 
Amplifications 1993, 9, 1-5). The higher effective concentration of primers and template 
molecules resulting from the low microwell volume increased the overall reaction 

20 efficiency and postponed the onset of the plateau phase until a higher molar yield was 
achieved. Alternatively, this effect was caused by the high concentration of Taq used in 
the PTPCR reactions, as elevated polymerase concentration has also been shown 
effective in delaying the plateau effect (Kainz, P., Biochim. Biophys. Acta 2000, 1494, 
23-27; Collins, F.S., et a/., Science 2003, 300, 286-290). The amplification efficiency 

25 over 40 cycles was 44.3, 68.9 and 67.5% for the 26, 50 and 76 ^m deep wells 

respectively, providing a high final concentration of amplicons. The greatest yield was 
observed in the 50 |im deep wells. It should be recognized, however, that cycle number 
optimization was not conducted; similar amplification yields could likely have been 
achieved with far fewer cycles, thereby increasing the efficiency of the PTPCR 

30 amplification. 
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The experimental strategy for clonal solid phase PTPCR, starting with a single 
effective copy of a single stranded DNA fragment, and finishing with a specific bead- 
immobilized DNA amplicon detected by fluorescent probe hybridization, is depicted in 
Figure 22 and described in detail below: 
5 Stage I: Each PicoTiterPlate™ well contains PCR reaction mix consisting of a 

single stranded template molecule (cither single stranded and annealed to the DNA 
capture beads, as shown here, or free-floating in solution), Forward "F" (red) and Reverse 
"R" (blue) primers in solution, as well as R primers attached to a DNA capture bead. 
Solution phase primers are present in an 8: 1 molar ratio, with the F primer in excess. 
10 Arrows indicate the 5' -> 3* DNA orientation. 

Stage 2: The initial thermal cycle denatures the DNA template, allowing R 
primers in solution to bind to the complementary region on the template molecule. 
Thermostable polymerases initiate elongation at the primer site (dashed line), and in 
subsequent cycles, solution-phase exponential amplification ensues. Bead immobilized 
15 primers are not assumed to be major contributors to the amplification at this stage. 

Stage 3: Early Phase PCR. During early exponential amplification (1 to 10 
cycles) both F and R primers amplify the template equally, despite an excess of F primers 
in solution. 

Stage 4: Mid Phase PCR. Between cycles 10 and 30, the R primers are depleted, 
20 halting exponential amplification. The reaction then enters an asymmetric amplification 
phase, and the amplicon population becomes increasingly dominated by F strands. 

Stage 5: Late Phase PCR. After 30 to 40 cycles, asymmetric amplification 
continues to increase the concentration of F strands in solution. Excess F strands, without 
R strand complements, begin to anneal to bead-immobilized R primers. Thermostable 
25 polymerases utilize the F strand as a template to synthesize an immobilized R strand of 
the amplicon. 

Stage 6: Final Phase PCR. Continued thermal cycling forces additional 
annealing to bead-bound primers. Solution phase amplification may be minimal at this 
stage, but concentration of immobilized R strands continues to increase. 



149 



Express Mail: EV 399526138 US 21465-510 UTIL 

Mailed January 28, 2004 

Stage 7: The non-immobilized, F strand, complementary to the immobilized R 
strand, is removed by alkali denaturation. The DNA capture beads are now populated by 
single stranded R strands of the amplicon. 

Stage 8: Fluorescently labeled probes (green bars) complementary to the R strand 

5 are annealed to the immobilized strand. Probes specific for particular strand sequences 
are labeled with unique fiuorophorcs, resulting in a range of homogenous and 
heterogeneous fluorescent signals depending on the number of discrete templates 
amplified within a given PicoTiterPlate™ well. 

Initially, fluorescently labeled probe specificity was confirmed by binding 

10 biotinylated Fragment A or Fragment B test DNA fragments to streptavidin Sepharosc 
beads, loading the beads into a 50 |im deep PicoTiterPlate™ by centrifugation and 
hybridizing a mixed population of fluorescently labeled probes for the Fragment A and 
Fragment B fragments. No mixed signals or nonspecific hybridizations were observed; 
the beads with the Fragment A product displayed the 488 nm signal, while the Fragment 

15 B beads exhibited the 635 nm signal (See Figures 23 A and 23B). Close examination of 
Figures 23 A and 23B reveals a few Fragment A beads in the Fragment B pad and vice 
versa. Given the purity of the signal displayed by these nomadic beads, it is likely that 
they are either the product of some cross contamination during the loading process, or 
were washed from one pad to the other during subsequent wash steps. 

20 As indicated in Figure 23C, the fluorescent probes detected successful solid phase 

PTPCR amplification of both Fragment A and Fragment B templates. The signals 
generated by the hybridized probe depended on the relative efficiency of dye 
incorporation within the probes, the sensitivity of the reactions to unequal amounts of 
template DNA, as well as the total and relative amounts of amplified product present on 

25 each bead. In addition, it is likely that the amount of template generated and retained on 
the DNA capture beads varied from well to well, and the number of capture primers 
bound to each bead is also likely to vary due to bead size distribution. As a result, the 
non-normalized ratios generated by the probe hybridization should be seen as semi- 
qualitative rather than quantitative data. Nevertheless, the fluorescent signals generated 

30 by the hybridized probes ranged from a homogeneous Fragment B signal (red) to an 
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equally homogenous Fragment A signal (green), with heterogeneous mixes of the two 
signals (degrees of yellow) evident as well. 

Due to the probe specificity displayed by the controls, as well as the sizeable 
number of homogenous red and green beads on the PicoTiterPIatc™. it is unlikely that 

5 nonspecific probe hybridization caused the heterogeneous signals. The close proximity 
of homogenous beads of either template suggests it is unlikely that the heterogeneous 
beads resulted from amplicon leakage between wells during amplification; if intra-well 
cross-talk were responsible, one would expect to see heterogeneous beads located 
between homogenous beads of either template, and a generally patchy distribution of 

10 homogenous signals. Rather, it is likely that template molecules disassociated from their 
original bead and reannealed to new beads in the PicoTiterPlate™ loading mix prior to 
being spun into the micro wells, or were washed from one bead to another as the PCR mix 
was applied to the PicoTiterPlate™. Regardless of the cause of the mixed template 
beads, the hybridization results show that PCR amplification in the PicoTiterPlate™ 

15 microwells can drive sufficient product to the DNA capture beads to enable fluorescent 
probe hybridization and detection. 



Discussion 

The results in this example demonstrate that PicoTiterPlate™-based PCR 
20 alleviates many factors associated with the DNA amplification process, such as high 
costs of reagents, large numbers of reactions, and lengthy reaction times, delivering 
another "evolutionary jump" in PCR technology. The microwells on a single 
PicoTiterPlate™ can function as up to 370,000 discrete reaction vessels achieving high 
yield (2.3 x 10 6 to 1.2 x 10 9 fold) amplification even at reaction volumes as low as 39.5 
25 picoliters. As a result, throughput is increased, and the total reagent cost for PTPCR is 
reduced; the reaction volume contained in an entire 26 or 76 ^m deep PicoTiterPIatc™ is 
15.3 and 43 \xl respectively. Increases in the size of the PicoTiterPlate™ can further 
increase the maximal throughput. For example, increasing the PicoTiterPlate™ 
dimensions to 40 mm x 75 mm provides approximately 1.4 x 10 6 discrete reaction 
30 vessels, and a PicoTiterPlate™ possessing the same perimeter dimensions as a 
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commercially available 96-well PCR plate (85.47 mm x 127.81 mm) could contain as 
many as 5.24 x 1 0 6 wells. 

Solution phase PCR amplifications, regardless of the number and volume in 
which they are conducted, are of limited utility unless the product can be recovered easily 
5 and efficiently. Previous efforsts in parallel PCR (Nagai, H., el ai, Anal. Chenu 2001, 
73, 1043-1047) required evaporation of the liquid reaction mixture, leaving the amplicon 
dried to the walls of the microreactor, after which it could be recovered for further 
manipulations. The methodology disclosed herein avoids the problems of product 
recovery by including solid phase amplification, immobilizing the PCR product to a 

10 DNA capture bead. Thus, the product of a PicoTiterPlate™ microwell reaction is not 
370,000 wells containing solution-phase PCR product, but up to 370,000 beads bound 
with immobilized PCR product. These PCR products are suitable for numerous solid- 
phase methods of nucleic acid interrogation including the potential capacity to support a 
massively parallel approach to sequencing whole genomes containing up to hundreds of 

1 5 millions of bases. The simplicity of the disclosed method would drastically reduce costs 
for sequencing and other applications now requiring robotics to maintain large-scale 
cloning and PCR. 

The disclosures of one or more embodiments of the invention are set forth in the 
accompanying description. Although any methods and materials similar or equivalent to 

20 those described herein can be used in the practice or testing of the present invention, the 
preferred methods and materials are now described. Other features, objects, and 
advantages of the invention will be apparent from the description and from the claims. In 
the specification and the appended claims, the singular forms include plural referents 
unless the context clearly dictates otherwise. Unless defined otherwise, ail technical and 

25 scientific terms used herein have the same meaning as commonly understood by one of 
ordinary skill in the art to which this invention belongs. Unless expressly stated 
otherwise, the techniques employed or contemplated herein are standard methodologies 
well known to one of ordinary skill in the art. The examples of embodiments are for 
illustration purposes only. All patents and publications cited in this specification are 

30 incorporated by reference. 
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Example 28: Rig Sequencing Method 

Step I: Preparation of pAdEasy PCR DNA Beads 

This proceedure was used for a 384-well plate PCR of Adenovirus clones. 

5 Streptavidin-Sepharose beads (12 mis) were prepared for binding PCR fragments by 
washing once with 2 M NaCl solution and resuspending in 288 mis of 2 M NaCl. The 
washed beads were transferred to fifteen 96-well plates at 200 pi of bead suspension/well. 
The PCR products (25 |il) were transferred to a 384-deep-well plate using a Tecan TeMo 
robot. To bind DNA to solid supports, 25 pi of bead suspension (15,000 beads) were 

10 added to each well of every 384-deep well plate using a Tecan TeMo robot and mixed. 
The final concentration of NaCl in the binding reaction was 1 M. The binding reaction 
was incubated with shaking at room temperature for 3 hr on a shaker. The contents of the 
microtiter plates were pooled by inverting the 384-well plates on to a reservoir and 
centrifuging at lOOOg in a Beckman Allegra benchtop centrifuge. The pooled beads were 

15 transferred into a 50 ml Falcon tube, centrifuged at 1 OOOg, and the supernatant was 
removed. 

Approximately a million beads (mobile solid support) were washed once with 100 
pi of 2 M NaCl followed by two washes with distilled water (100 pi each). The washed 
beads were incubated in 300 pi melting reagent (0.1 M NaCl and 0.125 M NaOH) for 10 
20 minutes in a rotator to remove the non-biolinylated DNA strand. The tube was 
centrifuged at maximum speed to pellet the beads and the melt solution was removed and 
discarded. The beads were washed with 100 pi of melt solution followed by three more 
washes with IX Annealing buffer. After the washes, the beads were resuspended in 25 pi 
of IX Annealing buffer. 

25 Primer P2 (500 pmoles) was added to the bead mixture and mixed. The bead 

mixture, in tubes, was placed into an automated incubator (PCR thermocycler in this 
case) with the following temperature profiles: incubation at 60°C for 5 minutes, decrease 
0.1 o C/second down to 50°C incubation at 50°C for 5 minutes, decrease 0.1°C/second 
down to 40°C, incubation at 40°C for 5 minutes, decrease 0.1°C/second down to 4°C, 

30 incubation at 4°C forever. 
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After annealing, the beads were washed carefully and resuspended in 200 \i\ of 
Bst DNA polymerase binding solution. Then, 10 \x\ aliquots (50.000 beads) of the bead 
suspension were processed for sequencing on the instrument described below. 



5 Step 2: Preparation of Control DNA Beads 

Six control DNA sequences TF 2, 7, 9, 10, 12 and 15 were cloned into 
pBluescript II KS + vector and plasmid DNA was used as template for PCR with one 
biotinylated primer for solid-phase immobilization of the amplicons. 

The following reagents were added to a 1.7 ml lube to create a PCR mix. 

10 



I0XHIFI buffer 


100 ul 


lOmMdNTP mix 


100 ul 


50 niM MgS0 4 


60 ul 


5'-Bio-3HEG-MMPlB 


10 ul 


MMP1A 


10 ul 


H1FI Taq Polymerase 


10 ul 


Mol. Bio. Grade Water 


690 ul I 



Twenty microliters of plasmid template DNA was added and the mix was 
aliquoted by 50 \i\ into 0.2 ml PCR tubes. The following program was used for 
thermocycling: incubation at 94°C for 4 minutes; 39 cycles of incubation at 94°C for 15 
15 seconds, 58°C for 30 seconds. 68°C for 90 seconds, and 68°C for 120 seconds; hold at 
10°C. 

Amplified DNA for each test fragment was purified using the Qiagen MinElute 
PCR Clean-Up Kit as per manufacturer's instructions. The purity and yield of each of the 
test fragments DNA was assessed using the Agilent 2100 Bioanalyzer and DNA 500 
20 reagent kit and chip. Biotinylated PCR products were immobilized onto Sepharose 
streptavidin beads at 10 million DNA copies/bead. 

Beads were washed once with 2 M NaCI solution. This was done by adding 100 
III, vortexing briefly to resuspend the beads, centrifiiging for 1 minute at maximum speed 
to pellet the beads, and then removing the supernatant. This was followed by a second 
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wash with 2 M NaCL The beads were then resuspended in 30 pi of 2 M NaCL PCR 
product was added to beads. The mixture was vortexed to resuspend the beads in 
solution and then placed in a rack, on a liter plate shaker, at speed 7, for I hour at room 
temperature. 

The non-biotinylated second strand was removed by incubation with the alkaline 
melt solution (0.1 M NaOH / 0.15 M NaCl) for 10 minutes in an overhead rotator at room 
temperature. This was followed by washing the beads once with 100 (il of melt solution 
and three times with 100 jil of IX annealing buffer (50 mM Tris-Acetate, pH 7.5; 5 mM 
MgCb). Sequencing primer was annealed to the immobilized single-stranded DNA by 
centrifugation for one minute at maximum speed. The supernatant was removed and the 
beads were resuspended in 25 pi of IX annealing buffer. Next, 5 pi of sequencing primer 
MMP7A (100 pmol/jil) was added to the bead suspension and the following temperature 
profile was used to hybridize the sequencing primer: 

• Incubation at 60°C for 5 minutes; 

• Decrease 0. 1 °C/second down to 50°C; 

• Incubation 50°C for 5 minutes; 

• Decrease 0. 1 °C/second down to 40°C: 

• Incubation at 40°C for 5 minutes; 

• Decrease 0. 1 °C/second down to 4°C; and 

• Hold at 4°C. 

Beads were washed twice with 100 pi of IX annealing buffer and then 
resuspended to a final volume of 200 p.1 with IX annealing buffer and stored in 10 pi 
aliquots in labeled tube strips in a 4°C refrigerator. 

Step 3: Sequencing Chemistry 

Sepharose beads with immobilized single stranded DNA templates and annealed 
sequencing primer were incubated with E. coli single strand binding protein (Amersham 
Biosciences) (5 pi of 2.5 pg/pl ssb stock solution per 50 ? 000 beads) and 500 U (10 pi of 
50U/jli1) of Bsl DNA polymerase (NEB) in 200 pi of Bst polymerase binding solution (25 
mM Tricine pH 7.8; 5 mM magnesium acetate; 1 mM DTT; 0.4 mg/ml PVP MW 
360,000) for 30 minutes at room temperature on a rotator. After this, the DNA beads 
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were mixed with the SL beads and deposited into the wells of the PicoTiter Plate as 
follows. Reagents required for a sequencing run on a 454 instrument included 1) 
substrate wash solution; 2) apyrase containing wash solution; 3) 100 nM inorganic 
pyrophosphate calibration standard; 4) individual nucleotide triphosphate solutions. 

5 All solutions were prepared in the sulfurylase-luciferase assay buffer with 

enzyme substrates (25 mM Tricine pH 7.8; 5 mM magnesium acetate; 0.4 mg/ml PVP 
MW 360,000; 0.01% Tween 20; 300 jaM D-luciferin; 4 |iM APS). The substrate wash 
solution was identical to the luciferase assay buffer. The apyrase containing wash 
solution was based on the luciferase assay buffer, except no enzyme substrates (APS and 

10 D-luciferin) were added and this wash contained apyrase (Sigma St. Lous, MO; 
Pyrosequencing AB, Pyrosequencing, Inc. Westborough, MA) in the final concentration 
of8.5U/l. 

Sodium pyrophosphate (PPi) standard was prepared by adding sodium 
pyrophosphate tetrabasic decahydrate (Sigma, St. Louis, MO) to the luciferase assay 

15 buffer to a final concentration 100 nM. Nucleotide triphosphates (dCTP, dGTP, TTP; 
minimum diphosphate grade) (Amershani Biosciences AB, Uppsala, Sweden) were 
diluted to final concentration of 6.5 jjM in the luciferase assay buffer. Deoxyadenosine 
triphosphate analog, 2 *-Dcoxyadenosine-5'-0-(l- thiotriphosphate), Sp-isomer (Sp- 
dATP-cc-S, Biolog Life Science Institute, Bremen, Germany) was diluted to final 

20 concentration of 50 nM in the luciferase assay buffer. 

Step 4: Cloning His6-BCCP-sulfurylase and His6-BCCP-Iuciferase 

Bacillus stearothermophihis (Bsi) ATP sulflirylase (E.C. 2.7.7.4) and firefly 
{Photinus pyralis) luciferase (E.C. 1.13.12.7) were cloned into Nhe 1-BamH I digested 

25 pRSET-A vector (Invitrogen). The coding sequence of the BCCP (biotin carboxyl carrier 
protein) gene (AlixJ.H., DNA 8 (10), 779-789 (1989); Muramatsu, S. and Mizuno, T., 
Nucleic Acids Res. 17 (10), 3982 (1989), Jackowski, S. and Aiix, J.H.. J. Bacteriol. 172 
(7), 3842-3848 (1990); Li. SJ. and Cronan, J.E. Jr., J. Biol. Chem. 267 (2), 855-863 
(1992), Genbank accession number M80458) was used to design PCR primers to amplify 

30 the fragment corresponding to amino acids 87-165 of the BCCP protein. The forward 
primer was 5'-ctagctagcatggaagcgccagcagca-3'; SEQ ID NO:42 and the reverse primer 
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was 5*-ccgggatccctcgatgacgaccagcggc-3 *, SEQ ID NO:43. The PCR cocktail was 
prepared as Mix I and Mix 2, 25 )il each. Mix 1 included 75 pmoles of primers, 100 ng 
of E. coli genomic DNA and 5 pmoles of dNTPs. Mix 2 included 1 unit of Fidelity 
Expand DNA polymerase (Boehringer Mannheim/Roche Diagnostics Corporation, 
5 Indianapolis. IN. Cat. No. 1 732 641) and 5 |il of 10X Fidelity Expand buffer (Boehringer 
Mannheim/Roche Diagnostics Corporation, Indianapolis, IN). To allow PCR hot-start, 
Mix 1 and Mix 2 were heated separately for 20 seconds at 96°C before they were pooled. 
The pooled reaction was cycled as follows: incubation at 96°C for 3 min, 10 cycles of 
incubation at 96°C for 30 sec, 55°C for 1 min, and 68°C for 2 min, then 20 cycles of 
10 incubation at 96°C for 30s ec, 60°C for 1 min, and 68°C for 2 min, followed by a 
polishing step of incubation at 72°C for 7 min. After PCR, a single 250 bp fragment was 
obtained. The BCCP fragment was digested with Nhe I and BamH 1 and subcloned into 
Mlie 1-BamH I digested pRSET-A. 

15 Step 5: Expression of Sulfurylase and Luciferase. 

The Bst ATP sulfurylase and P. pyralis luciferase open reading frames were 
amplified by PCR with primers that contain Pst VHind III and BamH VXIio 1 sites (the 
first enzyme was at the 5* end and the second enzyme was at the 3 ? end), respectively. 
This produced an N-terminal fusion of 6X His and BCCP domain to ATP sulfurylase and 

20 luciferase. The enzymes were expressed in E. coli using biotin-supplcmenled growth 
media to allow for in-vivo biotinylation via the BCCP domain. The enzymes were 
purified to near homogeneity using a combination of IMAC and a size-exclusion column 
chromatography. Purification was assessed by electrophoresis using the Agilent 2100 
Bioanalyzer on Protein 200 Plus chips. 

25 

Step 6: Solid Phase Immobilization of Luciferase and Sulfurylase 

The enzymes were immobilized onto Dynal M-280 streptavidin coated magnetic 
microparticles (Dynal, Oslo, Norway) and Bangs microspheres (300 nm) by incubation of 
1:3 mixture of ATP sulfurylase and luciferase, respectively. The binding was performed 
30 by mixing 50 \xg of ATP sulfurylase and 150 \ig of luciferase with 1 mg of Dynal M-280 
beads or 0.6 mg of Bangs microspheres in TAGE buffer (25 mM Tris-Acetate pH 7.8, 
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200 mM ammonium sulfate, 15% v/v glycerol and 30% v/v ethylene glycol). The 
mixture was incubated for 1 hour at 4°C on a rotator. After binding, the beads could be 
stored at -20 °C in the enzyme solution for 3 months. Before use, beads were washed 
thoroughly in luciferase assay buffer containing 0.1 mg/ml bovine serum albumin 
5 (Sigma* St Louis, MO). Immobilized enzyme activity was assayed using a luminometer 
(Turner, Sunnyvale, California). Washed beads were stored on ice until deposition onto a 
PTP slide. 



Step 7: PicoTiterPlates™ (PTPs) 
10 The PicoTiterPlates™ (25 x 75 x 2 mm) were manufactured by anisotropic etching 

of fiber optic face plates in a manner similar to that described in literature. Plates were 
etched in three different microwell depths, 26, 50 and 76 mm. Microwell center-to-center 
pitch was 50 pm, and well diameters ranged between 39 and 44 urn with a calculated well 
density of 480 wells/mm 2 . 

15 

Step 8: PTP Loading 

Scpharose beads carrying DNA templates and Dynal M-280 / Bangs 0.3 \im bead 
mixture with immobilized sulfurylase and luciferase enzymes were deposited into 
individual wells of a PicoTiter plate using a centrifugation-based method. The procedure 

20 employed an in-house polycarbonate fixture (jig) which included a bottom plate (with 
slide positioning pegs), an elastomer sealing gasket, and a top plate with two loading 
ports. The PTP slide was placed onto the bottom plate with the etched side facing up and 
the top plate with sealing gasket in place was clamped on top of the PTP slide. The 
whole assembly was tightened with four plastic screws in order to provide a water-tight 

25 seal. The sealing gasket was designed to form a mask for bead deposition, resulting in 
one hexagonal area (14 x 43 mm) covering roughly 270,000 PTP wells. 

Beads were deposited in ordered layers. The PTP was removed from incubating 
in Bead Wash Buffer. Layer 1, a mix of DNA and enzyme beads, was deposited. After 
centrifuging, Layer I supernatant was aspirated off the PTP and Layer 2, Dynal enzyme 

30 beads, was deposited. 
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A bead suspension was prepared by mixing 150,000 DNA carrying Sepharose 

beads in 120 |il of the ssb/Bst pol binding mix (see above) with 270 |il of Dynal-SL and 

Bangs-SL beads (both at 10 nig/ml) in a total volume of 500 \x\ of the luciferase assay 

buffer containing 0.1 mg/ml bovine serum albumin. The bead slurry was vortexed and 

5 flowed into the bead deposition jig through pipetting ports. Care was taken to aviod 
introducing air bubbles. The jig/PTP assembly was centrifuged at 2000 rpm for 8 
minutes in a Beckman Allegra 6 centrifuge equipped with a 4-position plate swing-out 
rotor. After centrifugation, the supernatant was carefully removed from the jig chamber 
using a pipette. A second layer of only Dynal-SL beads was deposited. This layer 

10 included 125 \x\ of Dynal-SL (at 10 mg/ml) and 375 ^1 Bead Wash Buffer in a 1.5 ml 
tube (2.5 mg/ml Dynal beads). The Dynal bead mixture was pipetted into the PTP main 
active area and centrifuged for 8 minutes at 2000 rpm. Layer 2 mixture was aspirated 
and the PTP was placed back into Bead Wash Buffer (luciferase assay buffer with 0.1 
mg/ml bovine serum albumin and 8.5 U/I apyrase) until ready to load onto the Sequencer. 

15 

Step 9: Sequencing Instrument 

The in-house sequencing instrument included three major assemblies: a fluidics 
subsystem, a PTP cartridge/flow chamber, and an imaging subsystem. The fluidics 
subsystem included reagent reservoirs, reagents inlet lines, a multi-valve manifold, and a 

20 peristaltic pump. It allowed for reagent delivery into the flow chamber, one reagent at a 
time, at a pre-programmed flow rate and duration. The PTP cartridge/flow chamber was 
designed in such a way that after attaching a PTP, there would be 300 urn space between 
the PTP top (etched side) and the chamber ceiling. It included means for temperature 
control of the reagents and PTP, as well as a light-tight housing. The polished side of a 

25 PTP was exposed at the back side of the PTP cartridge and was placed directly in contact 
with the imaging system. The imaging system comprised a CCD camera with a 1-1 
imaging fiber bundle, as well as cryogenic cooling system for the camera, and camera 
control electronics. The camera used was a Spectral Instruments (Tucson, AZ) series 600 
camera with a Fairchild Imaging LM485 CCD (16 million pixels, 15 yxiw pixel size). This 

30 was bonded directly to the imaging fiber bundle with 6 |im fiber pitch. The camera was 
cooled to -70° C and operated in a frame transfer mode, hi this way, the center portion of 
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the CCD was used for imaging while the outer portion of the CCD was used for image 
storage and read-out. The read-out occurred through 4 ports at each corner of the CCD. 
The data acquisition rate was set to 1 frame per 30 seconds. The frame-transfer shift time 
was approximately 0.25 seconds. All camera images were stored in a UTIFF 16 format 
5 on a computer hard drive (IBM eServer xSeries 335, IBM, White Plains, NY). 

Step 10: Sequencing Run Conditions 

The cyclical delivery of sequencing reagents into the PTP wells and washing of 
the sequencing reaction byproducts from the wells was achieved by a pre-programmed 

10 operation of the fluidics system. The program was written in a form of a Microsoft Excel 
script, specifying the reagent name (Wash, dATPaS, dCTP, dGTP, dTTP, PP; standard), 
flow rate and duration of each script step. Flow rate was set at 3 ml/min for all reagents 
and the linear velocity within the flow chamber was approximately. An initial wash step 
(5 minutes) was followed by a PP, standard flow (2 min), followed by 21 or 42 cycles of 

15 (Wash - C - Wash - A - Wash - G - Wash - T ), where each nucleotide flow was 0.5 
minute and wash steps were 2 minutes. After all cycles of nucleotide additions and 
washes, a second PP; standard flow (2 min) was delivered, followed by a final 5 minutes 
wash step. The total run time was 4 hours. Reagent volumes required to complete this 
run script were as follows: 300 ml each wash solution, 50 ml of each nucleotide solution, 

20 20 ml of PPi standard solution. During the run, all reagents were kept at room 
temperature. Because the flow chamber and flow chamber inlet tubing were maintained 
at 3 °C, all reagents entering the flow chamber were at 30°C. 
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CLAIMS 

WE CLAIM: 

1 . A method for sequencing nucleic acids comprising: 

(a) fragmenting large template nucleic acid molecules to generate a plurality 
of fragmented nucleic acids; 

(b) delivering the fragmented nucleic acids into aqueous microreactors in a 
water-in-oil emulsion such that a plurality of aqueous microreactors comprise a 
single copy of a fragmented nucleic acid, a single bead capable of binding to the 
fragmented nucleic acid, and amplification reaction solution containing reagents 
necessary to perform nucleic acid amplification; 

(c) amplifying the fragmented nucleic acids in the microreactors to form 
amplified copies of said nucleic acids and binding the amplified copies to beads in 
the microreactors; 

(d) delivering the beads to an array of at least 1 0,000 reaction chambers on a 
planar surface, wherein a plurality of the reaction chambers comprise no more than a 
single bead; and 

(e) performing a sequencing reaction simultaneously on a plurality of the 
reaction chambers. 

2. The method of claim 1 wherein the reaction chambers have a center to center 
spacing of 20 to 100 |im. 

3. The method of claim 1 wherein the fragmented nucleic acids are 30 - 500 bases. 

4. The method of claim 1 wherein a plurality of the beads bind at least 10,000 
amplified copies. 

5. The method of claim 1 wherein step (c) is accomplished using polymerase chain 
reaction. 

6. The method of claim 1 wherein the sequencing reaction is a pyrophosphate-based 
sequencing reaction. 

7. The method of claim 1 wherein the sequencing reaction comprises the steps of: 
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(a) annealing an effective amount of a sequencing primer to the 
amplified copies of the nucleic acid and extending the sequencing primer 
with a polymerase and a predetermined nucleotide triphosphate to yield a 
sequencing product and, if the predetermined nucleotide triphosphate is 
incorporated onto a 3' end of said sequencing primer, a sequencing reaction 
byproduct; and 

(b) identifying the sequencing reaction byproduct, thereby determining 
the sequence of the nucleic acid in a plurality of the reaction chambers. 

8. The method of claim 1 wherein the sequencing reaction comprises the steps of: 

(a) hybridizing two or more sequencing primers to one or a plurality of single 
strands of the nucleic acid molecule wherein all the primers except for one are 
reversibly blocked primers; 

(b) incorporating at least one base onto the nucleic acid molecule by 
polymerase elongation from an unblocked primer; 

(c) preventing further elongation of said unblocked primer; 

(d) deblocking one of the reversibly blocked primers into an unblocked 
primer; and 

(e) repeating steps (b) to (d) until at least one of the reversibly 
blocked primers are deblocked and used for determining a sequence. 

9. The method of claim 1 wherein the reaction chambers are cavities formed by 
etching one end of a fiber optic bundle. 

10. An array comprising a planar surface with a plurality of cavities thereon, each 
cavity forming an analyte reaction chamber, wherein the reaction chambers have a 
center to center spacing of between 20 to 100 (im and each cavity has a width in at 
least one dimension of between 20 jim and 70 (im, and wherein there are at least 
1 0,000 reaction chambers. 

11. The array of claim 10 wherein a plurality of reaction chambers contains at least 
100,000 copies of a single species of single stranded nucleic acid template. 
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12. The array of claim 1 1 wherein the single stranded nucleic acid templates are 
immobilized on mobile solid supports disposed in the reaction chambers. 

13. The array of claim 10 wherein the center to center spacing is between 40 to 
60 \im. 

14. The array of claim 10 wherein each cavity has a depth of between 20 urn and 
60 jam. 

15. An array comprising a planar top surface and a planar bottom surface wherein the 
planar top surface has at least 10,000 cavities thereon, each cavity forming an 
analyte reaction chamber, the planar bottom surface is optically conductive such 
that optical signals from the reaction chambers can be detected through the 
bottom planar surface, wherein the distance between the top surface and the 
bottom surface is no greater than 5 mm, wherein the reaction chambers have a 
center to center spacing of between 20 to lOO^m and each chamber having a 
width in at least one dimension of between 20|im and 70(am. 

16. The array of claim 15 wherein the distance between the top surface and the 
bottom surface is no greater than 2 mm. 

17. The array of claim 10 or 1 5 wherein the number of cavities is greater than 50,000. 

18. The array of claim 10 or 15 wherein the number of cavities is greater than 
100,000. 

19. The array of claim 10 or 15 wherein the shape of each reaction chamber is 
substantially hexagonal. 

20. The array of claim 10 or 15 wherein each cavity has at least one irregular wall 
surface. 

21. The array of claim 10 or 15 wherein the array is formed in a fused fiber optic 
bundle. 

22. The array of claim 10 or 15 wherein each cavity has a smooth wall surface. 

23. The array of claim 10 or 15 wherein the cavities are formed by etching one end of 
the fiber optic bundle. 
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24. The array of claim 10 or 15 wherein each cavity contains reagents for analyzing a 
nucleic acid or protein. 

25. The array of claim 1 0 or 1 5 further comprising a second surface spaced apart from 
the planar array and in opposing contact therewith such that a flow chamber is 
formed over the array. 

26. An array means for carrying out separate parallel common reactions in an aqueous 
environment, wherein the array means comprises a substrate comprising at least 
10,000 discrete reaction chambers containing a starting material that is capable of 
reacting with a reagent, each of the reaction chambers being dimensioned such 
that when one or more fluids containing at least one reagent is delivered into each 
reaction chamber, the diffusion time for the reagent to diffuse out of the well 
exceeds the time required for the starting material to react with the reagent to 
form a product. 

27. The array of claim 26 wherein each cavity contains reagents for analyzing a 
nucleic acid or protein. 

28. The array of claim 26 further comprising a population of mobile solid supports 
disposed in the reaction chambers, each mobile solid support having one or more 
bioactive agents attached thereto. 

29. The array of claim 26 wherein the cavities are formed in the substrate via etching, 
molding or micromaching. 

30. The array of claim 17 wherein the substrate is a fiber optic bundle. 

31. The array of claims 10, 15 or 26 wherein at least 5% to 20% of the reaction 
chambers contain at least one mobile solid support having at least one reagent 
immobilized thereon. 

32. The array of claims 10, 15 or 26 wherein at least 20% to 60% of the reaction 
chambers have at least one mobile solid support having at least one reagent 
immobilized thereon. 
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33. The array of claims 10, 15 or 26 wherein at least 50% to 100% of the reaction 
chambers have at least one mobile solid support having at least one reagent 
immobilized thereon. 

34. The array of claim 31 wherein the reagent immobilized on the mobile solid 
support is a polypeptide with sulfurylase activity. 

35. The array of claims 31 wherein the reagent immobilized on the mobile solid 
support is a polypeptide with luciferase activity. 

36. The array of claims 31 wherein the mobile solid support has both sulfurylase and 
luciferase immobilized. 

37. The array of claim 31 wherein a plurality of reaction chambers contains at least 
100,000 copies of a single species of single stranded nucleic acid template. 

38. The array of claim 31 wherein the single stranded nucleic acid templates are 
immobilized on mobile solid supports disposed in the reaction chambers. 

39. The array of claim 10, 15 or 26 wherein the nucleic acid is suitable for use in a 
pyrosequencing reaction. 

40. A method for delivering a bioactive agent to an array, comprising dispersing over 
the array a plurality of mobile solid supports, each mobile solid support having at 
least one reagent immobilized thereon, wherein the reagent is suitable for use in a 
nucleic acid sequencing reaction, where the array comprises a planar surface with 
a plurality of reaction chambers disposed thereon, wherein the reaction chambers 
have a center to center spacing of between 20 to 100 \im and each reaction 
chamber has a width in at least one dimension of between 20 Jim and 70 pm. 

41. An apparatus for simultaneously monitoring an array of reaction chambers for 
light indicating that a reaction is taking place at a particular site, the apparatus 
comprising: 

(a) an array of reaction chambers formed from a planar substrate comprising a 
plurality of cavitaled surfaces, each cavitaled surface forming a reaction chamber adapted 
to contain analytes, and wherein the reaction chambers have a center to center spacing of 
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between 20 lo 100 jini, each reaction chamber having a volume of between 10 to 150 pL, 
the array comprising more than 1 0,000 discrete reaction chambers; 

(b) an optically sensitive device arranged so that in use the light from a 
particular reaction chamber will impinge upon a particular predetermined region of said 
optically sensitive device; 

(c) means for determining the light level impinging upon each of said 
predetermined regions and 

(d) means to record the variation of said light level with time for each of said 
reaction chamber. 

42. An analytic sensor, comprising: 

(a) an array formed from a first bundle of optical fibers with a plurality of 
cavitated surfaces at one end thereof, each cavitated surface forming a reaction chamber 
adapted to contain analytes, and wherein the reaction chambers have a center to center 
spacing of between 20 to 100 jxm, a width of 20 to 70 jam, the array comprising more 
than 10,000 discrete reaction chambers; 

(b) an enzymatic or fluorescent means for generating light in the reaction 
chambers; 

(c) light detection means comprising a light capture means and a second fiber 
optic bundle for transmitting light to the light detecting means, the second fiber optic 
bundle being in optical contact with the array, such that light generated in an individual 
reaction chamber is captured by a separate fiber or groups of separate fibers of the second 
fiberoptic bundle for transmission to the light capture means. 

43. The sensor of claim 42 wherein said sensor is suitable for use in a biochemical 
assay. 

44. The sensor of claim 42 wherein said sensor is suitable for use in a cell-based 
assay. 

45. The sensor of claim 42 wherein the light capture means is a CCD camera. 

46. The sensor of claim 42 wherein the reaction chambers contain one or more mobile 
solid supports with a bioactive agent immobilized thereon. 
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47. A method for carrying out separate parallel common reactions in an aqueous 
environment, comprising: 

(a) delivering a fluid containing at least one reagent to an array, wherein the 
array comprises a substrate comprising at least 10,000 discrete reaction chambers, each 
reaction chamber adapted to contain analytcs, and wherein the reaction chambers have a 
volume of between 10 to 150 pL and containing a starting material that is capable of 
reacting with the reagent, each of the reaction chambers being dimensioned such that 
when the fluid is delivered into each reaction chamber, the diffusion time for the reagent 
to diffuse out of the well exceeds the time required for the starting material to react with 
the reagent to form a product; and 

(b) washing the fluid from the array in the time period (i) after the starting 
material has reacted with the reagent to form a product in each reaction chamber but 
(ii) before the reagent delivered to any one reaction chamber has diffused out of that 
reaction chamber into any other reaction chamber. 

48. The method of claim 47 wherein the product formed in any one reaction chamber 
is independent of the product formed in any other reaction chamber, but is 
generated using one or more common reagents 

49. The method of claim 47 wherein the starting material is a nucleic acid sequence 
and at least one reagent in the fluid is a nucleotide or nucleotide analog 

50. The method of claim 47 wherein the fluid additionally comprises a polymerase 
capable of reacting the nucleic acid sequence and the nucleotide or nucleotide 
analog 

51. The method of claim 47 additionally comprising repeating steps (a) and 
(b) sequentially. 

52. A method for delivering nucleic acid sequencing enzymes to an array, said array 
having a planar surface with a plurality of cavities thereon, each cavity forming an 
analyte reaction chamber, wherein the reaction chambers have a center to center 
spacing of between 20 to 100 ^m; the method comprising dispersing over the 
array a plurality of mobile solid supports having one or more nucleic acid 
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sequencing enzymes immobilized thereon, such that a plurality of the reaction 
chambers contain at least one mobile solid support. 

53. The method of claim 52 wherein one of the nucleic acid sequencing enzymes is a 
polypeptide having sulfurylase activity, luciferase activity or both. 

54. A method for delivering a plurality of nucleic acid templates to an array, said 
array having a planar surface with a plurality of cavities thereon, each cavity 
forming an analyte reaction chamber, wherein the reaction chambers have a center 
to center spacing of between 20 to 100 \im and the array having at least 10,000 
reaction chambers; the method comprising dispersing over the array a plurality of 
mobile solid supports, each mobile solid support having no more than a single 
species of nucleic acid template immobilized thereon, the dispersion causing no 
more than one mobile solid support to be disposed within any one reaction 
chamber. 

55. The method of claim 54 wherein the nucleic add sequence is a single stranded 
nucleic acid. 

56. The method of claim 54 wherein at least 100,000 copies of a single species of 
nucleic acid template are immobilized on a plurality of the mobile solid supports. 

57. The method of claim 54 wherein each single species of nucleic acid template is 
amplified on a picotiter plate to produce at least 2,000,000 copies per well of said 
nucleic acid template after being disposed in the reaction chamber. 

58. The method of claim 57 wherein the nucleic acid sequence is amplified using an 
amplification technology selected from the group consisting of polymerase chain 
reaction, ligase chain reaction and isothermal DNA amplification. 

59. A method for sequencing a nucleic acid, the method comprising: 

(a) providing a plurality of single-stranded nucleic acid templates disposed 
within a plurality of cavities on a planar surface, each cavity forming an analyte reaction 
chamber, wherein the reaction chambers have a center to center spacing of between 20 to 
100 \im and the planar surface has at least 10,000 reaction chambers; 
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(b) performing a pyrophosphate based sequencing reaction simultaneously on 
all reaction chambers by annealing an effective amount of a sequencing primer to the 
nucleic acid templates and extending the sequencing primer with a polymerase and a 
predetermined nucleotide triphosphate to yield a sequencing product and, if the 
predetermined nucleotide triphosphate is incorporated onto the 3' end of said sequencing 
primer, a sequencing reaction byproduct; and 

(c) identifying the sequencing reaction byproduct, thereby determining the 
sequence of the nucleic acid in each reaction chamber. 

60. The method of claim 59, wherein the sequencing reaction byproduct is PPi and a 
coupled sulftirylase/luciferase reaction is used to generate light for detection. 

61. The method of claim 60, wherein either or both of the sulfurylase and luciferase 
are immobilized on one or more mobile solid supports disposed at each reaction 
site. 

62. A method of determining the base sequence of a plurality of nucleotides on an 
array, the method comprising: 

(a) providing at least 1 0,000 DNA templates, each separately disposed within 
a plurality of cavities on a planar surface, each cavity forming an analyte reaction 
chamber, wherein the reaction chambers have a center to center spacing of between 20 to 
100 ^im, and a volume of between 10 to 150 pL; wherein 

(b) adding an activated nucleotide S'-triphosphate precursor of one known 
nitrogenous base to a reaction mixture in each reaction chamber, each reaction mixture 
comprising a template-directed nucleotide polymerase and a single-stranded 
polynucleotide template hybridized to a complementary oligonucleotide primer strand at 
least one nucleotide residue shorter than the templates to form at least one unpaired 
nucleotide residue in each template at the 3 '-end of the primer strand, under reaction 
conditions which allow incorporation of the activated nucleoside 5 '-triphosphate 
precursor onto the 3"-end of the primer strands, provided the nitrogenous base of the 
activated nucleoside 5 '-triphosphate precursor is complementary to the nitrogenous base 
of the unpaired nucleotide residue of the templates; 
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(c) detecting whether or not the nucleoside S'-triphosphate precursor was 
incorporated into the primer strands in which incorporation of the nucleoside 5'- 
triphosphate precursor indicates that the unpaired nucleotide residue of the template has a 
nitrogenous base composition that is complementary to that of the incorporated 
nucleoside 5 '-triphosphate precursor; and 

(d) sequentially repeating steps (b) and (c), wherein each sequential repetition 
adds and, detects the incorporation of one type of activated nucleoside 5'-triphosphate 
precursor of known nitrogenous base composition; and 

(e) determining the base sequence of the unpaired nucleotide residues of the 
template in each reaction chamber from the sequence of incorporation of said nucleoside 
precursors. 

63. A method of identifying the base in a target position in a DNA sequence of 
template DNA, wherein: 

(a) at least 10,000 separate DNA templates are separately disposed within a 
plurality of cavities on a planar surface, each cavity forming an analyte reaction chamber, 
wherein the reaction chambers have a center to center spacing of between 20 to 100 ^im ? 
said DNA being rendered single stranded either before or after being disposed in the 
reaction chambers, 

(b) an extension primer is provided which hybridizes to said immobilized 
single-stranded DNA at a position immediately adjacent to said target position; 

(c) said immobilized single-stranded DNA is subjected to a polymerase 
reaction in the presence of a predetermined deoxynucleotide or dideoxynucleotide, 
wherein if the predetermined deoxynucleotide or dideoxynucleotide is incorporated onto 
the 3' end of said sequencing primer then a sequencing reaction byproduct is formed; and 

(d) identifying the sequencing reaction byproduct, thereby determining the 
nucleotide complementary to the base at said target position in each of the 10,000 DNA 
templates. 

64. The method of claim 63 wherein in place of deoxy- or dideoxy adenosine 
triphosphate (ATP) a dATP or ddATP analogue is used which is capable of acting 
as a substrate for a polymerase but incapable of acting as a substrate for said 
PPi — detection enzyme. 
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65. An apparatus for analyzing a nucleic acid sequence, the apparatus comprising: 

(a) a reagent delivery cuvette, wherein the cuvette includes an array 
comprising a planar surface with a plurality of cavities thereon, each cavity forming an 
analyte reaction chamber, wherein the reaction chambers have a center to center spacing 
of between 20 to 100 (am, and there are in excess of 10,000 reaction chambers, and 
wherein the reagent delivery cuvette contains reagents for use in a sequencing reaction: 

(b) a reagent delivery means in communication with the reagent delivery 
cuvette; 

(c) an imaging system in communication with the reagent delivery chamber; 

and 

(d) a data collection system in communication with the imaging system. 

66. An apparatus for determining the base sequence of a plurality of nucleotides on an 
array, the apparatus comprising: 

(a) a reagent cuvette containing a plurality of cavities on a planar surface, 
each cavity forming an analyte reaction chamber, wherein there are in excess of 10,000 
reaction chambers, each having a center to center spacing of between 20 to 100 |im and a 
volume of between 1 0 to 1 50 pL; 

(b) reagent delivery means for simultaneously adding to each reaction 
chamber an activated nucleotide 5 1 - triphosphate precursor of one known nitrogenous 
base to a reaction mixture in each reaction chamber, each reaction mixture comprising a 
template-directed nucleotide polymerase and a single-stranded polynucleotide template 
hybridized to a complementary oligonucleotide primer strand at least one nucleotide 
residue shorter than the templates to form at least one unpaired nucleotide residue in each 
template at the 3'-end of the primer strand, under reaction conditions which allow 
incorporation of the activated nucleoside 5 '-triphosphate precursor onto the 3'-end of the 
primer strands, provided the nitrogenous base of the activated nucleoside 5' -triphosphate 
precursor is complementary to the nitrogenous base of the unpaired nucleotide residue of 
the templates; 

(c) detection means for detecting in each reaction chamber whether or not the 
nucleoside 5 -triphosphate precursor was incorporated into the primer strands in which 
incorporation of the nucleoside 5' -triphosphate precursor indicates that the unpaired 



172 



Express Mail: EV 399526138 US 21465-510 UTIL 

Mailed January 28, 2004 

nucleotide residue of the template has a nitrogenous base composition that is 
complementary to that of the incorporated nucleoside 5'-triphosphale precursor; and 

(d) means for sequentially repeating steps (b) and (c), wherein each sequential 
repetition adds and, detects the incorporation of one type of activated nucleoside 5 - 
triphosphate precursor of known nitrogenous base composition; and 

(e) data processing means for determining the base sequence of the unpaired 
nucleotide residues of the template simultaneously in each reaction chamber from the 
sequence of incorporation of said nucleoside precursors. 

67. An apparatus for processing a plurality of analytes, the apparatus comprising: 

(a) a flow chamber having disposed therein a substrate comprising at least 
50,000 cavitated surfaces on a fiber optic bundle, each cavitated surface forming a 
reaction chamber adapted to contain analytes, and wherein the reaction chambers have a 
center to center spacing of between 20 to 100 pm and a diameter of 20 to 70 pm; 

(b) fluid means for delivering processing reagents from one or more reservoirs 
to the flow chamber so that the analytes disposed in the reaction chambers are exposed to 
the reagents; and 

(c) detection means for simultaneously detecting a sequence of optical signals 
from each of the reaction chambers, each optical signal of the sequence being indicative 
of an interaction between a processing reagent and the analyte disposed in the reaction 
chamber, wherein the detection means is in communication with the cavitated surfaces. 

68. The apparatus of claim 67 wherein the detection means is a CCD camera. 

69. The apparatus of claim 67 wherein the analyte is nucleic acid. 

70. The apparatus of claim 67 wherein the analytes are immobilized on one or more 
mobile solid supports that are disposed in the reaction chamber. 

71. The apparatus of claim 67 wherein the processing reagents are immobilized on 
one or more mobile solid supports. 

72. A method for sequencing a nucleic acid, the method comprising: 

(a) providing a plurality of single-stranded nucleic acid templates in an array 
having at least 50,000 discrete reaction sites; 
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(b) contacting the nucleic acid templates with reagents necessary to perform a 
pyrophosphate-based sequencing reaction coupled to light emission; 

(c) detecting the light emitted from a plurality of reaction sites on respective 
portions of an optically sensitive device; 

(d) converting the light impinging upon each of said portions of said optically 
sensitive device into an electrical signal which is distinguishable from the signals from all 
of said other reaction sites; 

(e) determining the sequence of the nucleic acid templates based on light 
emission for each of said discrete reaction sites from the corresponding electrical signal. 

73. The method of claim 1 further comprising the steps of: 

(a) uniquely tagging fragmented nucleic acids from different biological 
sources libraries to create libraries of fragmented nucleic acids with different detectable 
sequence tags; 

(b) sequencing said fragmented nucleic acids and detecting said detectable 
sequence tag from each said tagged nucleic acid fragment. 

74. The method of claim 1 wherein the libraries are delivered individually or wherein 
the libraries are mixed and delivered simultaneously. 

75. The method of claim I wherein said detectable sequence tag comprises an 
oligonucleotide of between 2 and 50 bases. 

76. A method for sequencing nucleic acids comprising: 

(a) fragmenting large template nucleic acid molecules to generate a plurality 
of fragmented nucleic acids; 

(b) attaching one strand of a plurality of the fragmented nucleic acids 
individually to beads to generate single stranded nucleic acids attached 
individually to beads; 

(c) delivering a population of the single stranded fragmented nucleic acids 
attached individually to beads to an array of at least 10,000 reaction 
chambers on a planar surface, wherein a plurality of the wells comprise 
no more than a one bead with on single stranded fragmented nucleic acid; 
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(d) performing a sequencing reaction simultaneously on a plurality of the 
reaction chambers. 

77. The method of claim 76 wherein the reaction chambers have a center to center 
spacing of between 20 to 100 [im. 

78. The method of claim 76 wherein the fragmented nucleic acids are between 30 - 
500 bases. 

79. The method of claim 76 wherein the fragmented nucleic acids are amplified in the 
reaction chambers prior to step (d). 

80. The method of claim 76 wherein the amplifying step is accomplished using 
polymerase chain reaction. 

81. The method of claim 76 wherein the sequencing reaction is a pyrophosphate- 
based sequencing reaction. 

82. The method of claim 76 wherein the sequencing reaction comprises the steps of: 

(f) annealing an effective amount of a sequencing primer to the single stranded 
fragmented nucleic acid templates and extending the sequencing primer 
with a polymerase and a predetermined nucleotide triphosphate to yield a 
sequencing product and, if the predetermined nucleotide triphosphate is 
incorporated onto the 3' end of said sequencing primer, a sequencing 
reaction byproduct; and 

(g) identifying the sequencing reaction byproduct, thereby determining the 
sequence of the nucleic acid in a plurality of the reaction chambers. 

83. The method of claim 76 wherein the sequencing reaction comprises the steps of: 

(a) hybridizing two or more sequencing primers to one or a plurality of single 
strands of the nucleic acid molecule wherein all the primers except for one are 
reversibly blocked primers; 

(b) incorporating at least one base onto the nucleic acid molecule by 
polymerase elongation from an unblocked primer; 

(c) preventing further elongation of said unblocked primer; 
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(d) deblocking one of the reversibly blocked primers into an unblocked 
primer; and 

(e) repeating steps (b) to (d) until at least one of the reversibly blocked 
primers are deblocked and used for determining a sequence. 

84. The method of claim 76 wherein the reaction chambers arc cavities formed by 
etching one end of a fiber optic bundle. 
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Abstract 

An apparatus and method for performing rapid DNA sequencing, such as genomic 
sequencing, is provided herein. The method includes the steps of preparing a sample 
DNA for genomic sequencing, amplifying the prepared DNA in a representative manner, 
and performing multiple sequencing reaction on the amplified DNA with only one primer 
hybridization step. 
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required items and pay any fees required below to avoid abandonment. Extensions of time may be obtained by 
filing a petition accompanied by the extension fee under the provisions of 37 CFR 1 .136(a). 

• The oath or declaration is unsigned. 

• To avoid abandonment, a late filing fee or oath or declaration surcharge as set forth in 37 CFR 1 .16(e) of 
$65 for a small entity in compliance with 37 CFR 1.27, must be submitted with the missing items identified 
in this letter. 

The application is informal since it does not comply with the regulations for the reason(s) indicated below. 
The required item(s) identified below must be timely submitted to avoid abandonment: 

• Replacement drawings in compliance with 37 CFR 1 .84 and 37 CFR 1 .1 21 are required. The drawings 
submitted are not acceptable because: 

■ More than one figure is present and each figure is not labeled "Fig." with a consecutive 
Arabic numeral (1 , 2, etc.) or an Arabic numeral and capital letter in the English 
alphabet (A, B. etc.)(see 37 CFR 1.84(u)(1)). See Figure(s) 8A.8B. 

■ The drawings submitted to the Office are not electronically reproducible. Drawing 
sheets must be submitted on paper, which is flexible, strong, white, smooth, non-shiny, 
and durable (see 37 CFR 1.84(e)). See Figure(s) 3A-3D,9,10A-10F,22,39. 



• This application clearly fails to comply with the requirements of 37 CFR. 1 .821 -1 .825. Applicant's attention 
is directed to the final rulemaking notice published at 55 FR 18230 (May 1 , 1990), and 1 1 14 OG 29 (May 
15. 1990). If the effective filing date is on or after July 1, 1998, see the final rulemaking notice published at 
63 FR 29620 (June 1. 1998) and 121 1 OG 82 (June 23. 1998). If the effective filing date is on or after 
September 8, 2000, see the final rulemaking notice published in the Federal Register at 65 FR 54604 



(September 8. 2000) and 1238 OG 145 (September 19. 2000). Applicant must provide an initial computer 
readable form (CRF) copy of the "Sequence Listing", an initial paper or compact disc copy of the 
"Sequence Listing", as well as an amendment directing its entry into the application. Applicant must also 
provide a statement that the'content of the sequence listing information recorded in computer readable 
form is identical to the written (on paper or compact disc) sequence listing and, where applicable, includes 
no new matter, as required by 37 CFR 1.821(e), 1.821(f), 1.821(g). 1 .825(b), or 1.825(d). If applicant 
desires the sequence listing in the instant application to be identical with that of another application on file 
in the U.S. Patent and Trademark Office, such request in accordance with 37 CFR 1.821(e) may be 
submitted in lieu of a new CRF. 



For questions regarding compliance to these requirements, please contact: 

■ For Rules Interpretation, call (703) 308-4216 

■ To Purchase Patentln Software, call (703) 306-2600 

■ For Patentln Software Program Help, call (703) 306-4119 ore-mail at 
patin21help@uspto.gov or patin3help@uspto.gov 

The following item(s) appear to have been omitted from the application: 
• Figure(s) 11C&11D described in the specification. 

I. Should applicant contend that the above-noted omitted item(s) was in fact deposited in the U.S. Patent and 
Trademark Office (USPTO) with the nonprovisional application papers, a copy of this Notice and a petition (and 
$130.00 petition fee (37 CFR 1.17(h))) with evidence of such deposit must be filed within TWO MONTHS of the 
date of this Notice. The petition fee will be refunded if is determined that the item(s) was received by the USPTO. 

II. Should applicant desire to supply the omitted item(s) and accept the date that such omitted item(s) was filed in 
the USPTO as the filing date of the above-identified application, a copy of this Notice, the omitted item(s) (with a 
supplemental oath or declaration in compliance with 37 CFR 1.63 and 1.64 referring to such items), and a petition 
under 37 CFR 1 .182 (with the $1 30.00 petition fee (37 CFR 1. 17(h)) requesting the later filing date must be filed 
within TWO MONTHS of the date of this Notice. 

Applicant is advised that generally the filing fee required for an application is the filing fee in effect on the filing 
date accorded the application and that payment of the requisite basic filing fee on a date later than the filing date 
of the application requires payment of a surcharge (37 CFR 1.16(e)). To avoid processing delays and payment of 
a surcharge, applicant should submit any balance due for the requisite filing fee based on the later filing date 
being requested when submitting the omitted items(s) and the petition (and petition fee) requesting the later filing 
date. 

III. The failure to file a petition (and petition fee) under the above options (I) or (II) within TWO MONTHS of the 
date of this Notice (37 CFR 1.181 (0) will be treated as a constructive acceptance by the applicant of the 
application as deposited in the USPTO. THIS TWO MONTH PERIOD IS NOT EXTENDABLE UNDER 37 CFR 
1.136(a) or (b). In the absence of a timely filed petition in reply to this Notice, the application will maintain a filing 
date as of the date of deposit of the application papers in the USPTO, and original application papers [i.e. , the 
original disclosure of the invention) will include only those application papers present in the USPTO on the date of 
deposit. 

In the event that applicant elects not to take action pursuant to options (I) or (II) above (thereby constructively 
electing option (III)), amendment of the specification to renumber the pages consecutively and cancel incomplete 
sentences caused by any omitted page(s), and/or amendment of the specification to cancel all references to any 
omitted drawing(s), relabel the drawing figures to be numbered consecutively (if necessary), and correct the 
references in the specification to the drawing figures to correspond with any relabeled drawing figures, is required. 
A copy of the drawing figures showing the proposed changes in red ink should accompany with any drawing 
changes . Such amendment and/or correction to the drawing figures, if necessary, should be by way of 
preliminary amendment submitted prior to the first Office action to avoid delays in the prosecution of the 
application. 



The applicant needs to satisfy supplemental fees problems indicated below. 

The required item(s) identified below must be timely submitted to avoid abandonment: 

• Additional claim fees of S439 as a small entity, including any required multiple dependent claim fee, are 
required. Applicant must submit the additional claim fees or cancel the additional claims for which fees are due. 

SUMMARY OF FEES DUE: 

Total additional fee(s) required for this application is S504 for a Small Entity 
• S65 Late oath or declaration Surcharge. 

• Total additional claim fee(s) for this application is $439 

■ S258 for 1 5 independent claims over 3. 

■ S36 for 91 total claims over 20. 

■ S145 for multiple dependent claim surcharge. 

Replies should be mailed to: Mail Stop Missing Parts 

Commissioner for Patents 
P.O. Box 1450 
Alexandria VA 22313-1450 



A copy of this notice MUST be returned with the reply. 

i& i 

C itsMiner Service Center 

Initial Patent Examination Division (703) 308-1202 

PART 2 - COPY TO BE RETURNED WITH RESPONSE 



MAR 0 8 7(106 



Express Mail Label No.: EV45?428127l!S^ 
Date of Deposit: September 22. 2004 



Attorney Docket No. 21465-510 UTIL 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



APPLICANTS : John H. Leamon, et aL CONFIRMATION NUMBER : 5560 

SERIAL NUMBER : 10/767.779 Examiner: Not Yet Assigned 

FILING Date : January 28, 2004 ART UNIT : 1 632 

FOR : METHODS OF AMPLIFYING AND SEQUENCING NUCLEIC ACIDS 



MAIL STOP MISSING PARTS 

Commissioner for Patents 
P. O. Box 1450 
Alexandria, VA 22313-1450 

PRELIMINARY AMENDMENT TO ACCOMPANY 
SEQUENCE LISTING AND DRAWINGS 

Sir: 

To comply with the Notice to File Missing Parts of Nonprovisional Application mailed June 
22, 2004, Applicants submit herewith this Preliminary Amendment, Replacement Drawing Sheets, a 
Sequence Listing in paper and computer readable forms, and a Statement indicating that the paper and 
computer readable forms are identical. While Applicants believe that no additional fees are needed, 
the USPTO is authorized to charge or credit the undersigned Deposit Account No. 50 031 1, Reference 
No. 21465-510 UTIL, for any additional fees that are required. Applicants respectfully request entry 
and consideration of this Amendment in the application. 

Before Examining the above identified application, please amend the application as follows: 

Amendments to the Specification begin on page 2 of this paper. 
Amendments to the Drawings begin on page 3 of this paper. 
Remarks begin on page 4 of this paper. 



Express Mail Label No.: EV4?T4281271IS 
Date of Deposit: September 22, 2004 



Attorney Docket No. 21465-510 UT1L 



Amendments to the Spe cification: 

Please insert the attached Sequence Listing after page 161 of the specification. 

Please replace the paragraph on page 8. line 24 with the following paragraph: 
Figure 19 depicts a micrograph of single well reactions. Snlfurylase on mohile support with 
lucjferase on surface. PPi flows across FORA s urfa ce. One m obile support per pixel 
nn ramcra: 50 urn FORA well spann ^ four 15 urn pixels. Shown is 100 x 100 pixels 
(inK pixelsl. 1.5 mm 2 . 



2 



Express Mail Label No.: EV45T328127US *~ ,«„«cinirni 

Date of Deposit: September 22, 2004 Attorney Docket No. 21465-510 UT1L 

Amendments to the Drawings : 

The attached forty-nine (49) sheets of drawings include Figure 1 A - Figure 42. These sheets, 
which include Figure 1 A - Figure 42, replaces the original sheets including Figure 1A - Figure 42. 



Attachment: Replacement Sheets 



3 



Express Mail Label No.: EV45&28127US 
Date of Deposit: September 22, 2004 



Attorney Docket No. 21465-510 UTIL 



Remarks 



In accordance with the requirements of the Notice to File Missing Parts of Nonprovisional 
Application mailed June 22, 2004, Applicants have submitted herewith: 1) a Preliminary Amendment 
to include the Sequence Listing and Replacement Sheets for the drawings; 2) Replacement Sheets for 
Figure 1A - Figure 42; 3) a Sequence Listing in paper and computer readable forms; and 4) a 
Statement indicating that the paper and computer readable forms are identical. 

The Sequence Listing and amendments to the specification and drawings are supported by the 
application as originally filed, and do not contain new matter. The amended paragraph for page 8, line 
24 is supported by the text shown on Figure 19 as originally filed. The drawings for Figures 1 1C and 
1 ID are supported by original Figures 1 1C and 1 ID from U.S. Application No. 60/476,592, which was 
incorporated by reference in the instant application (see, inter alia, page 1, lines 11-12 and lines 25- 



A prompt allowance is respectfully requested. If any discussion regarding this Amendment is 
desired, the Examiner is respectfully urged to contact the undersigned at the number given below, and 
is assured of full cooperation in progressing the application to allowance. 



25). 



Conclusion 



Respectfully submitted, 



Dated: September 22, 2004 




Ivor Elrifi,Reg. No. 39,529 

Eric Sinn, Reg. No.40,177 

Caryn DeHoratius Reg. No. 45,881 

MINTZ, LEVIN, COHN, FERRIS, et. al 

The Chrysler Center 

666 Third Avenue, 24 th Floor 

New York, NY 10017 

Telephone: (212)935-3000 

Telefax: (212)983-3115 



NYC 300O40vl 
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SEQUENCE LISTING 

<110> Leamon, John H. 

Lohman, Kenton L. 
Rothberg, Jonathan M. 
weiner, Michael P. 

<120> METHOD OF AMPLIFYING AND SEQUENCING NUCLEIC ACIDS 

<130> 21465-510 UTIL 

<140> US 10/767,779 

<141> 2004-01-28 

<150> US 60/476,602 

<151> 2003-06-06 

<150> US 60/476,504 

<151> 2003-06-06 

<150> US 60/476,313 

<151> 2003-06-06 

<150> US 60/476,592 

<151> 2003-06-06 

<150> US 60/465,071 

<151> 2003-04-23 

<150> US 60/443,471 

<151> 2003-01-29 

<150> US 60/497,985 

<151> 2003-08-25 

<160> 60 

<170> Patentln version 3.2 

<210> 1 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 1 

cgtttcccct gtgtgccttg ccatctgttc cctccctgtc atgc 



<210> 2 

<211> 40 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 2 

gcatgacagg gagggaacag atggcaaggc acacagggga 



<210> 3 



1 



<211> 40 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 3 

gcatgacacg caacagggga tagggacacg cacgcaacag 40 

<210> 4 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 4 

ccatctgttg cgtgcgtgtc cctatcccct gttgcgtgtc atgc 44 

<210> 5 

<211> 64 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 5 

tcgtgtgagg tctcagcatc ttatgtatat ttacttctat tctcagttgc ctaagctgca 60 
gcca 64 

<210> 6 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 6 

gacctcacac gatggctgca gctt 24 

<210> 7 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 7 

gacctcacac gatggctgca gctt 24 



<210> 8 
<211> 64 
<212> DNA 



2 



<213> Artificial 
<220> 

<223> Probe 

<400> 8 - n 
tttatatgta ttctacgact ctggagtgtg ctaccgacgt cgaatccgtt gactcttatc 60 

ttca 64 

<210> 9 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 9 

ctagctcgta catataaatg aagataagat cctg 3* 

<210> 10 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 10 

gacctcacac gagtagcatg gctgcagctt 30 

<210> 11 

<211> 64 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 11 

tcgtgtgagg tctcagcatc ttatgtatat ttacttctat tctcagttgc ctaagctgca 60 



gcca 



<210> 12 

<211> 40 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



64 



<400> 12 

gcttacctga ccgacctctg cctatcccct gttgcgtgtc 40 



<210> 13 
<211> 40 
<212> DNA 



<213> Artificial 
<220> 

<223> Primer 
<400> 13 

ccattcccca gctcgtcttg ccatctgttc cctccctgtc 



<210> 14 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 14 

gcttacctga ccgacctctg 



<210> 15 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 15 

ccattcccca gctcgtcttg 



<210> 16 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 16 

ccattcccca gctcgtcttg ccatctgttc cctccctgtc tcag 



<210> 17 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 17 

ccatctgttc cctccctgtc 



<210> 18 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<400> 18 

cctatcccct gttgcgtgtc 



<210> 19 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 19 

cgtttcccct gtgtgccttg 



<210> 20 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 20 

ccatctgttg cgtgcgtgtc 



<210> 21 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 21 

ccatctgttc cctccctgtc 



<210> 22 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 22 

cctatcccct gttgcgtgtc 



<210> 23 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 23 

ccatctgttg cgtgcgtgtc 



<210> 24 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<4O0> 24 

cgtttcccct gtgtgccttg 



<210> 25 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 25 

catcttgtcc actaggctct 



<210> 26 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 26 

ccatctgttg cgtgcgtgtc 



<210> 27 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 27 

accagcactc gcaccacc 



<210> 28 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 28 

cgtttcccct gtgtgccttg 



<210> 29 
<211> 18 
<212> DNA 



<213> Artificial 
<220> 

<223> Primer 
<400> 29 

tacctctccg cgtaggcg 



<210> 30 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 30 

ccatctgttg cgtgcgtgtc 



<210> 31 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 31 

ccccggacga gacgcag 



<210> 32 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 32 

atctctgcct actaaccatg aag 



<210> 33 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 33 

catcttgtcc actaggctct 



<210> 34 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<400> 34 

gtttctctcc agcctctcac cga 



<210> 35 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 35 

accagcactc gcaccacc 



<210> 36 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 36 

atctctgcct actaaccatg aag 



<210> 37 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 37 

tacctctccg cgtaggcg 



<210> 38 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 38 

gtttctctcc agcctctcac cga 



<210> 39 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 39 
ccccggacga gacgcag 



<210> 40 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 40 

cgtttcccct gtgtgccttg 



<210> 41 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 41 

ccatctgttg cgtgcgtgtc 



<210> 42 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 42 

ctagctagca tggaagcgcc agcagca 



<210> 43 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 43 

ccgggatccc tcgatgacga ccagcggc 



<210> 44 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 44 

atgcacatgg ttgacacagt ggt 



<210> 45 
<211> 22 
<212> DNA 
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FIG. 1A 
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. 1 B 
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° b Biotin 
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Gel isolation 



Method 1 
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FIG. I E 



Bind non-nicked dsDNA to streptavidin-coated — ssD.NA 
beads and sing le strand elution J 

Bind to Sepharose DNA Capture Bead (1 ssDNA per bead) 

I 

Deposition of a single DNA Capture Bead per well 
"FTP well | 

Amplification 
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Sequence 
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